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APPARATUS FOR DECOUPLING IN HIGH 
PERFORMANCE COMMUNICATION BETWEEN 
SOFTWARE PROCESSES 

BACKGROUND OF THE INVENTION 
5 The invention pertains to the field of decoupled 

information exchange between software processes running on 
different or even the same computer where the software 
processes may use different formats for data representa- 
tion and organization or may use the same formats and 
10 organization but said formats and organization may later 
be changed without requiring any reprogramming. Also, the 
software processes use "semantic" or field-name informa- 
tion in such a way that each process can understand and 
use data it has received from any foreign software proc- 
15 ess, regardless of semantic or field name differences* 
The semantic information is decoupled from data 
representation and organization information. 

With the proliferation of different types of 
computers and software programs and the ever-present need 
20 for different types of computers running different types 
of software programs to exchange data, there has arisen a 
need for a system by which such exchanges of data can oc- 
cur. Typically , data that must be exchanged between 
software modules that are foreign to each other comprises 
25 text, data and graphics. However, there occasionally 
arises the need to exchange digitized voice or digitized 
image data or other more exotic forms of information. 
These different types of data are called "primitives." A 
software program can manipulate only the primitives that 
30 it is programmed to understand and manipulate, other 
types of primitives, when introduced as data into a 
software program, will cause errors. 

"Foreign," as the term is used herein, means that the 
software modules or host computers involved in the 
35 exchange "speak different languages." For example, the 
Motorola and Intel microprocessor widely used in personal 
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computers and work stations use different data representa- 
tions in that in one family of microprocessors the most 
signif icant byte of multibyte words is placed first while 
in the other family of processors the most significant 
5 byte is placed last. Further, in IBM computers text let- 
ters are coded in EBCDIC code while in almost all other 
computers text letters are coded in ASCII code* Also, 
there are several different ways of representing numbers 
including integer, floating point, etc. Further, foreign 

10 software modules use different ways of organizing data and 
use different semantic information, i.e., what each field 
in a data record is named and what it means. 

The use of these various formats for data 
representation and organization means that translations 

15 either to a common language or from the language of one 
computer or process to the language of another computer or 
process must be made before meaningful communication can 
take place. Further, many software modules between which 
communication is to take place reside on different comput- 

20 ers that are physically distant from each other and con- 
nected only local area networks, wide area networks, 
gateways, satellites, etc. These various networks have 
their own widely diverse protocols for communication. 
Also, at least in the world of financial services, the 

25 various sources of raw data such as Dow Jones News or 
Telerate 1 " use different data formats and communication 
protocols which must be understood and followed to receive 
dav.a from these sources. 

, In complex data situations such as financial data 

30 regarding equities, bonds, money markets, etc., it is 
often useful to have nesting of data. That is, data 
regarding a particular subject is often organized as a 
data record having multiple "fields," each field pertain- 
ing to a different aspect of the subject. It is often 

35 useful to allow a particular field to have subfields and a 
particular subf ield to have its own subfields and so on 
for as many levels as necessary. For purposes of discus- 
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sion herein, this type of data organization is called 
"nesting." The names of the fields and what they mean 
relative to the subject will be called the "semantic 
information" for purposes of discussion herein. The 
5 actual data representation for a particular field, i.e., 
floating point, integer, alphanumeric, etc., and the 
organization of the data record in terms of how many 
fields it has, which are primitive fields which contain 
only data, and which are nested fields which contain 
10 subfields, is called the "format" or "type" information 
for purposes of discussion herein. A field which contains 
only data (and has no nested subfields) will be called a 
"primitive field," and a field which contains other fields 
will be called a "constructed field" herein. 
15 There are two basic types of operations that can 

occur in exchanges of data between software modules. The 
first type of operation is called a "format operation" and 
involves conversion of the format of one data record 
(hereafter data records may sometimes be called "a forms") 
20 to another format. An example of such a format operation 
might be conversion of data records with floating point 
and EBCDIC fields to data records having the packed 
representation needed for transmission over an ETHERNET*" 
local area network. At the receiving process end another 
25 format operation for conversion from the ETHERNET" 1 packet 
format to integer and ASCII fields at the receiving proc- 
ess or software module might occur. Another type of 
operation will be called herein a "semantic-dependent 
operation" because it requires access to the semantic 
30 information as well as to the type or format information 
about a form to do some work on the form such as to supply 
a particular field of that form, e.g., today's IBM stock 
price or yesterday's IBM low price, to some software 
module that is requesting same. 
35 Still further, in today's environment, there are 

often multiple sources of different types of data and/ or 
multiple sources of the same type of data where the 
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sources overlap in cc erage but use different formats and 
different communication protocols (or even overlap with 
the same format and the same communication protocol) . It 
is useful for a software module (software modules may 
5 hereafter be sometimes referred to as "applications") to 
be able to obtain information regarding a particular 
subject without knowing the network address of the service 
that provides information of that type and without knowing 
the details of the particular communication protocol 

10 needed to communicate with that information source. 

A need has arisen therefore for a communication 
system which can provide an interface between diverse 
software modules, processes and computers for reliable, 
meaningful exchanges of data while "decoupling" these 

15 software modules and computers, "Decoupling" means that 
the software module programmer can access information from 
other computers or software processes without knowing 
where the other software modules and computers are in a 
network, the format that forms and data take on the 

20 foreign software, what communication protocols are neces- 
sary to communicate with the foreign software modules or 
computers, or what communication protocols are used to 
transit any networks between the source process and the 
destination process; and without knowing which of a 

25 multiple of sources of raw data can supply the requested 
data. Further, "decoupling," as the term is used herein, 
means that data can be requested at one time and supplied 
at another and that one process may obtain desired data 
from the instances of forms created with foreign format 

30 and foreign semantic data through the exercise by a com- 
munication interface of appropriate semantic operatic <s to 
extract the request d data from the foreign forms wit he 
extraction process being transparent to the requesting 
process. 

35 Various systems exist in the prior art to allow 

information exchange between foreign software modules with 
various degrees of decoupling. One such type of system is 
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any electronic mail software which implements Electronic 
Document Exchange Standards including CCITT's X.409 
standard. Electronic mail software decouples applications 
in the sense that format or type data is included within 
5 each instance of a data record or form. However, there 
are no provisions for recording or processing of semantic 
information. Semantic operations such as extraction or 
translation of data based upon the name or meaning of the 
desired field in the foreign data structure is therefore 

10 impossible. Semantic-Dependent Operations are very 
important if successful communication is to occur. 
Further, there is no provision in Electronic Mail Software 
by which subject -based addressing can be implemented 
wherein the requesting application simply asks for 

15 information by subject without knowing the address of the 
source of information of that type. Further, such 
software cannot access a service or network for which a 
communication protocol has not already been established. 
Relational Database Software and Data Dictionaries 

20 are another example of software systems in the prior art 
for allowing foreign processes to share data. The 
shortcoming of this class of software is that such 
programs can handle only "flat" tables, records and fields 
within records but not nested records within records. 

25 Further, the above-noted shortcoming in Electronic Mail 
Software also exists in Relational Database Software. 

SUMMARY OF THE INVENTION 

According to the teachings of the invention, there is 
provided a method and apparatus for providing a structure 
30 to interface foreign processes and computers while 
providing a degree of decoupling heretofore unknown. 

The data communication interface software system 
according to the teachings of the invention consists es- 
sentially of several libraries of programs organized into 
35 two major components, a communication component and a 
data-exchange component. Interface, as the term is used 
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herein in the context of the invention, means a collection 
of functions which may be invoked by the application to do 
useful work in communicating with a foreign process or a 
foreign computer or both. Invoking functions of the 
5 interface may be by subroutine calls from the application 
or from another component in the communications interface 
according to the invention. 

In the preferred embodiment, the functions of the 
interface are carried out by the various subroutines in 
10 the libraries of subroutines which together comprise the 
interface. Of course, those skilled in the art will 
appreciate that separate programs or modules may be used 
instead of subroutines arid may actually be preferable in 
some cases. 

15 Data format decoupling is provided such that a first 

process using data records or forms having a first format 
can communicate with a second process which has data 
records having a second, different format without the need 
for the first process to know or be able to deal with the 

20 format used by the second process. This form of 

decoupling is implemented via the data-exchange component 
of the communication interface software system. 

The data-exchange component of the communication 
interface according to the teachings of the invention 

25 includes a forms -manager module and a forms-class manager 
module. The forms-manager module handles the creation, 
storage, recall and destruction of instances of forms and 
calls to the various functions of the forms-class manager. 
The latter handles the creation, storage, recall, 

30 interpretation, and destruction of forms-class descriptors 
which are data records which record the format and 
semantic information that pertain to particular classes of 
forms. The forms-class manager can also receive requests 
from the application or another component of the com- 

35 munication interface to get a particular field of an 

instance of a form when identified by the name or meaning 
of the field, retrieve the appropriate form instance, and 
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extract and deliver the requested data in the appropriate 
field* The forms-class manager can also locate the class 
definition of an unknown class of forms by looking in a 
known repository of such class definitions or by request- 
5 ing the class definition from the forms-class manager 

linked to the foreign process which created the new class 
of form. Semantic data, such as field names, is decoupled 
from data representation and organization in the sense 
that semantic information contains no information regard- 
10 ing data representation or organization. The communica- 
tion interface of the invention implements data decoupling 
in the semantic sense and in the data format sense. In 
the semantic sense, decoupling is implemented by virtue of 
the ability to carry out semantic-dependent operations. 
15 These operations allow any process coupled to the com- 
munications interface to exchange data with any other 
process which has data organized either the same or in a 
different manner by using the same field names for data 
which means the same thing in the preferred embodiment. 
20 In an alternative embodiment semantic-dependent operations 
implement an aliasing or synonym conversion facility 
whereby incoming data fields having different names but 
which mean a certain thing are either relabeled with field 
names understood by the requesting process or are used as 
25 if they had been so relabeled. 

The interface according to the teachings of the 
invention has a process architecture organized in 3 
layers. 

Architectural decoupling is provided by an 
30 information layer such that a requesting process can 
request data regarding a particular subject without 
knowing the network address of the server or process where 
the data may be found. This form of decoupling is 
provided by a subject-based addressing system within the 
35 information layer of the communication component of the 
interface. 

Subject-based addressing is implemented by the 
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communication component of the communication interface of 
the invention by subject mapping. The communication 
component receives "subscribe" requests from an ap- 
plication which specifies the subject upon which data is 
5 requested. A subject-mapper module in the information 
layer receives the request from the application and then 
looks up the subject in a database, table or the like. 
The database stores "service records" which indicate the 
various server processes that supply data on various 

10 subjects. The appropriate service record identifying the 
particular server process that can supply data of the 
requested type and the communication protocol (hereafter 
sometimes called the service discipline) to use in 
communicating with the identified server process is 

15 returned to the subject-mapper module. 

The subject mapper has access to a plurality of 
communications library programs or subroutines on the 
second layer of the process architecture called the 
service layer. The routines on the service layer are 

20 called "service disciplines." Each service discipline 
encapsulates a predefined communication protocol which is 
specific to a server process. The subject mapper then 
invokes the appropriate service discipline identified in 
the service record. 

25 The service discipline is given the subject by the 

subject mapper and proceeds to establish communications 
with the appropriate server process.* Thereafter , 
instances of forms containing data regarding the subject 
are sent by the server process to the requesting process 

30 via the service discipline which established the com- 
munication. 

arvice protocol decoupling is provided by the 
service layer. 

Temporal decoupling is implemented in some service 
35 disciplines directed to page-oriented server processes 
such as Telerate*" by access to real-time data bases which 
store updates to pages to which subscriptions are 
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outstanding. 

A third layer of the distributed communication 
component is called the communication layer and provides 
configuration decoupling. This layer includes a DCC 
5 library of programs that receives requests to establish 
data links to particular server and determines the best 
communication protocol to use for the link unless the 
protocol to use for the link unless the protocol is 
already established by the request. The communication 

10 layer also includes protocol engines to encapsulate 

various communication protocols such as point-to-point, 
broadcast, reliable broadcast and the Intelligent 
Multicast™ protocol. Some of the functionality of the 
communication layer augments the functionality of the 

15 standard transport protocols of the operating system and 
provides value added services. 

One of these value added services is the reliable 
broadcast protocol. This protocol engine adds sequence 
numbers to packets of packet i zed messages on the transmit 

20 side and verifies that all packets have been received on 
the receive side. Packets are stored for retransmission 
on the transmit side. On the receive side, if all packets 
did not come in or some are garbled, a request is sent for 
retransmission. The bad or missing packets are then 

25 resent. When all packets have been successfully received, 
an acknowledgment message is sent. This causes the 
transmit side protocol engine to flush the packets out of 
the retransmit buffer to make room for packets of the next 
message. 

30 Another value added service is the Intelligent 

Multicast Protocol. This protocol involves the service 
discipline examining the subject of a message to be sent 
and determining how many subscribers there are for this 
message subject, if the number of subscribers is below a 

35 threshold set by determining costs of point-to-point 

versus broadcast transmission, the message is sent point- 
to-point. Otherwise the message is sent by the reliable 
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broadcast protocol. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram illustrating the 
relationships of the various software modules of the com- 
5 munication interface of one embodiment of the invention to 
client applications and the network. 

Figure 2 is an example of a form-class definition of 
the constructed variety. 

Figure 3 is an example of another constructed 
10 form-class definition. 

Figure 4 is an example of a constructed form-class 
definition containing fields that are themselves 
constructed forms. Hence, this is an example of nesting. 
Figure 5 is an example of three primitive form 
15 classes. 

Figure 6 is an example of a typical form instance as 
it is stored in memory. 

Figure 7 illustrates the partitioning of semantic 
data, format data, and actual or value data between the 
20 form-class definition and the form instance. 

Figure 8 is a flow chart of processing during a 
format operation. 

Figure 9 is a target format-specific table for use in 
format operations. 
25 Figure 10 is another target format-specific table for 

use in format operations. 

Figure 11 is an example of a general conversion table 
for use in format operations. 

Figure 12 is a flow chart for a typical 
30 semantic-dependent operation. 

Figures 13 A and 13B are, respectively, a class 
definition and the class descriptor form which stores this 
class definition. 

Figure 14 is a block diagram illustrating the 
35 relationships between the subject-mapper module and the 
service discipline modules of the communication cpmponent 
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to the requesting application and the service for 
subject-based addressing. 

Figure 15 illustrates the relationship of the various 
modules, libraries and interfaces of an alternative 
5 embodiment of the invention to the client applications. 

Figure 16 illustrates the relationships of various 
modules inside the communication interface of an 
alternative embodiment. 

Figure 17 is a block diagram of a typical distributed 
10 computer network. 

Figure 18 is a process architecture showing the 
relationship of the DCC library to the DCC protocol 
engines in the daemon. 

Figure 19, comprised of Figures 19A and 19B, is a 
15 flow diagram of the process which occurs, inter alia, at 
the three layers of the software of the invention where a 
subscribe request is sent to a service. 

Figure 20, comprised of Figures 2 OA and 2 OB, is a 
flow chart of the process which occurs at, inter alia, the 
20 three layers of the software interface according to the 
teachings of the invention when a subscribe request is 
received at a data producing process and messages flow 
back to the subscribing process. 

Figure 21, comprised of Figures 21A and 2 IB, is a 
25 flow chart of the process which occurs at the DCC library 
and in the reliable broadcast protocol engine when 
messages are sent by the reliable broadcast protocol. 

Figure 22, comprised of Figures 22A and 22B, is a 
flow chart of processing by a reliable broadcast protocol 
30 engine on the data consumer side of the reliable broadcast 
transaction. 

Figure 23 is a flow chart of the processing which 
occurs in the service discipline to implement the 
Intelligent Multicast*" protocol. 
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DETAILED DESCRIPTION OF THE PREFERRED AND ALTERNATIVE 
EMBODIMENTS 

Since the following description is highly technical, 
it can best be understood by an understanding of the terms 
5 used in the digital network telecommunication art defined 
in the appended glossary. The reader is urged to read 
this glossary first. 

Referring to Figure l there is shown a block diagram 
of a typical system in which the communications interface 

10 of the invention could be incorporated, although a wide 
variety of system architectures can benefit from the 
teachings of the invention. The communication interface 
of the invention may be sometimes hereafter referred to as 
the TIB™ or Teknekron Information Bus in the specification 

15 of an alternative embodiment given below. The reader is 
urged at this point to study the glossary of terms 
included in this specification to obtain a basic 
understanding of some of the more important terms used 
herein to describe the invention. The teachings of the 

20 invention are incorporated in several libraries of 

computer programs which, taken together, provide a com- 
munication interface having many functional capabilities 
which facilitate modularity in client application develop- 
ment and changes in network communication or service com- 

25 munication protocols by coupling of various client ap- 
plications together in a "decoupled" fashion. Hereafter, 
the teachings of the invention will be referred to as the 
communication interface. "Decoupling," as the term is 
used herein, means that the programmer of client applica- 

30 tion is freed of the necessity to know the details of the 
communication protocols, data representation format and 
data record organization of all the other applications or 
services with which data exchanges are desired. Further, 
the programmer of the client application need not know the 

35 location of services or servers providing data on 

particular subjects in order to be able to obtain data on 
these subjects. The communication interface automatically 
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takes care of all the details in data exchanges between 
client applications and between data-consumer applications 
and data-provider services. 

The system shown in Figure 1 is a typical network 
5 coupling multiple host computers via a network or by 

shared memory. Two host computers, io and 12, are shown 
in Figure 1 running two client applications 16 and 18, 
although in other embodiments these two client ap- 
plications may be running on the same computer. These 
10 host computers are coupled by a network 14 which may be 
any of the known networks such as the ETHERNET™ communica- 
tion protocol, the token ring protocol, etc. A network 
for exchanging data is not required to practice the 
invention, as any method of exchanging data known in the 
15 prior art will suffice for purposes of practicing the 
invention. Accordingly, shared memory files or shared 
distributed storage to which the host computers 10 and 12 
have equal access will also suffice as the environment in 
which the teachings of the invention are applicable. 
20 Each of the host computers 10 and 12 has random 

access memory and bulk memory such as disk or tape drives 
associated therewith (not shown) . Stored in these 
memories are the various operating system programs, client 
application programs, and other programs such as the 
25 programs in the libraries that together comprise the com- 
munication interface which cause the host computers to 
perform useful work. The libraries of programs in the 
communication interface provide basic tools which may be 
called upon by client applications to do such things as 
30 find the location of services that provide data on a 

particular subject and establish communications with that 
service using the appropriate communication protocol. 

Each of the host computers may also be coupled to 
user interface devices such as terminals, printers, etc. 
35 (not shown) . 

In the exemplary system shown in Figure 1, host 
computer 10 has stored in its memory a client application 
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program 16. Assume that this client application program 
16 requires exchanges of data with another client applica- 
tion program or service 18 controlling host computer 12 in 
order to do useful work. Assume also that the host 
5 computers 10 and 12 use different formats for representa- 
tion of data and that application programs 16 and 18 also 
use different formats for data representation and 
organization for the data records created thereby. These 
data records will usually be referred to herein as forms. 

10 Assume also that the data path 14 between the host comput- 
ers 10 and 12 is comprised of a local area network of the 
ETHERNET™ variety. 

Each of the host processors 10 and 12 is also 
programmed with a library of programs, which together 

15 comprise the communication interfaces 20 and 22, 

respectively. The communication interface programs are 
either linked to the compiled code of the client applica- 
tions by a linker to generate run time code, or the source 
code of the communication programs is included with the 

20 source code of the client application programs prior to 
compiling. In any event, the communication library 
programs are somehow bound to the client application. 
Thus, if host computer 10 was running two client applica- 
tions, each client application would be bound to a com- 

25 munication interface module such as module 20. 

The purpose of the communications interface module 20 
is to decouple application 16 from the details of the data 
format and organization of data in forms used by 
application 18, the network address of application 18, and 

30 the details of the communication protocol used by ap- 
plication 18, as well as the details of the data format 
and organization and communication protocol necessary to 
send data across network 14. Communication interface 
module 22 serves the same function for application 18, 

35 thereby freeing it from the need to know many details 
about the application 16 and the network 14. The com- 
munication interface modules facilitate modularity in that 
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changes can be made in client applications, data formats 
or organizations, host computers, or the networks used to 
couple all of the above together without the need for 
these changes to ripple throughout the system to ensure 
5 continued compatibility. 

In order to implement some of these functions, the 
communications interfaces 20 and 22 have access via the 
network 14 to a network file system 24 which includes a 
subject table 26 and a service table 28. These tables 
10 will be discussed in more detail below with reference to 
the discussion of subject-based addressing. These tables 
list the network addresses of services that provide 
information on various subjects. 

A typical system model in which the communication 
15 interface is used consists of users, users groups, 

networks, services, service instances (or servers) and 
subjects. Users, representing human end users, are 
identified by a user-ID. The user ID used in the com- 
munications interface is normally the same as the user ID 
20 or log-on ID used by the underlying operating system (not 
shown) . However, this need not be the case. Each user is 
a member of exactly one group. 

Groups are comprised of users with similar service 
access patterns and access rights. Access rights to a 
25 service or system object are grantable at the level of 
users and at the level of groups. The system 
administrator is responsible for assigning users to 
groups . 

A "network," as the term is used herein, means the 
30 underlying "transport layer" (as the term is used in the 
ISO network layer model) and all layers beneath the 
transport layer in the ISO network model. An application 
can send or receive data across any of the networks to 
which its host computer is attached. 
35 The communication interface according to the 

teachings of the invention, of which blocks 20 and 22 in 
Figure 1 are exemplary, includes for each client applica- 
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tion to which it is bound a communications component 30 
and a data-exchange component 32. The communications 
component 30 is a common set of communication facilities 
which implement, for example, subject-based addressing 
5 and/ or service discipline decoupling. The communications 
component is linked to each client application. In addi- 
tion, each communications component is linked to the 
standard transport layer protocols, e.g., TCP/IP, of the 
network to which it is coupled. Each communication 

10 component is linked to and can support multiple transport 
layer protocols. The transport layer of a network does 
the following things: it maps transport layer addresses 
to network addresses, multiplexes transport layer connec- 
tions onto network connections to provide greater 

15 throughput, does error detection and monitoring of service 
quality, error recovery, segmentation and blocking, flow 
control of individual connections of transport layer to 
network and session layers, and expedited data transfer. 
The communications component provides reliable communica- 

20 tions protocols for client applications as well as provid- 
ing location transparency and network independence to the 
client applications. 

The data-exchange component of the communications 
interface, of which component 32 is typical, implements a 

25 powerful way of representing and transmitting data by 

encapsulating the data within self -describing data objects 
called forms. These forms are self -describing in that 
they include not only the data of interest, but also type 
or format information which describes the representations 

30 used for the data and the organization of the form. 

Because the forms include this type or format information, 
format operations to convert a particular form having one 
format to another format can be done using strictly the 
data in the form itself without the need for access to 

35 other data called class descriptors or class definitions 
which give semantic information. The meaning of semantic 
information in class descriptors basically means the names 
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of the fields of the form. 

The ability to perform format operations solely with 
the data in the form itself is very important in that it 
prevents the delays encountered when access must be made 
5 to other data objects located elsewhere, such as class 
descriptors. Since format operations alone typically 
account for 25 to 50% of the processing time for client 
applications, the use of self -describing objects 
streamlines processing by rendering it faster. 

10 The self-describing forms managed by the 

data-exchange component also allow the implementation of 
generic tools for data manipulation and display. Such 
tools include communication tools for sending forms 
between processes in a machine- independent format. 

15 Further, since self -describing forms can be extended, 
i.e., their organization changed or expanded, without 
adversely impacting the client applications using said 
forms, such forms greatly facilitate modular application 
development. 

20 Since the lowest layer of the communications 

interface is linked with the transport layer of the ISO 
model and since the communications component 30 includes 
multiple service disciplines and multiple transport- layer 
protocols to support multiple networks, it is possible to 

25 write application-oriented protocols which transparently 
switch over from one network to another in the event of a 
network failure, 

A "service" represents a meaningful set of functions 
which are exported by an application for use by its client 

30 applications. Examples of services are historical news 
retrieval services such as Dow Jones New, Quotron data 
feed, and a trade ticket router. Applications typically 
export only one service, although the export of many 
different services is also possible. 

35 A "service instance" is an application or process 

capable of providing the given service. For a given 
service, several "instances" may be concurrently providing 
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the service so as to improve the throughput of the service 

or provide fault tolerance. 

Although networks, services and servers are 

traditional components known in the prior art, prior art 
5 distributed systems do not recognize the notion of a 

subject space or data independence by self -describing, 

nested data objects. Subject space supports one form of 

decoupling called subject-based addressing. 

Self -describing data objects which may be nested at 
10 multiple levels are new. Decoupling of client 

applications from the various communications protocols and 

data formats prevalent in other parts of the network is 

also very useful. 

The subject space used to implement subject-based 
15 addressing consists of a hierarchical set of subject 

categories. In the preferred embodiment, a fcur- level 

subject space hierarchy is used. An example of a typical 

subject is: "equity. ibm. composite. trade." The client 

applications coupled to the communications interface have 
20 the freedom and responsibility to establish conventions 

regarding use and interpretations of various subject 

categories • 

Each subject is typically associated with one or more 
services providing data about that subject in data records 

25 stored in the system files. Since each service will have 
associated with it in the communication components of the 
communication interface a service discipline, i.e., the 
communication protocol or procedure necessary to 
communicate with that service, the client applications may 

30 request data regarding a particular subject without 

knowing where the service instances that supply data on 
that subject are located on the network by making 
subscription requests giving only the subject without the 
network address of the service providing information on 

35 that subject. These subscription requests are translated 
by the communications interface into an actual 
communication connection with one or more service 
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instances which provide information on that subject. 

A set of subject categories is referred to as a 
subject domain. Multiple subject domains are allowed. 
Each domain can define domain-specific subject and coding 
functions for efficiently representing subjects in message 
headers . 



DATA IND EPENDENCE: The Data-Exchange Comp onent-. 

The overall purpose of the data-exchange component 
such as component 32 in Figure 1 of the communication 
10 interface is to decouple the client applications such as 
application 16 from the details of data representation, 
data structuring and data semantics. 

Referring to Figure 2, there is shown an example of a 
class definition for a constructed class which defines 
15 both format and semantic information which is common to 
all instances of forms of this class. In the particular 
example chosen, the form class is named Playerjfame and 
has a class ID of 1000. The instances of forms of this 
class 1000 include data regarding the names, ages and NTRP 
20 ratings for tennis players. Every class definition has 
associated with it a class number called the class ID 
which uniquely identifies the class. 

The class definition gives a list of fields by name 
and the data representation of the contents of the field. 
25 Each field contains a form and each form may be either 
primitive or constructed. Primitive class forms store 
actual data, while constructed class forms have fields 
which contain other forms which may be either primitive or 
constructed. In the class definition of Figure 2, there 
30 are four fields named Rating, Age, Last_Name and 

FirstJIame. Each field contains a primitive class form so 
each field in instances of forms of this class will 
contain actual data. For example, the field Rating will 
always contain a primitive form of class 11. Class ll is 
35 a primitive class named Floating_Point which specifies a 
floating-point data representation for the contents of 
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this field. The primitive class definition for the class 
Floating_Point , class 11 , is found in Figure 5. The class 
definition of the primitive class 11 contains the class 
name. Floating^ Point, which uniquely identifies the class 
5 (the class number, class 11 in this example, also uniquely 
identifies the class) and a specification of the data 
representation of the single data value. The 
specification of the single data value uses well-known 
predefined system data types which are understood by both 

10 the host computer and the application dealing with this 
class of forms. 

Typical specifications for data representation of 
actual data values include integer, floating point, ASCII 
character strings or EBCDIC character strings, etc. In 

15 the case of primitive class 11, the specification of the 
data value is Floating_Point_l/l which is an arbitrary 
notation indicating that the data stored in instances of 
forms of this primitive class will be floating-point data 
having two digits total, one of which is to the right of 

20 the decimal point. 

Returning to the consideration of the Player_Name 
class definition of Figure 2, the second field is named 
Age. This field contains forms of the primitive class 
named Integer associated with class number 12 and defined 

25 in Figure 5. The Integer class of form, class 12, has, 
per the class definition of Figure 5, a data 
representation specification of Integer_3, meaning the 
field contains integer data having three digits. The last 
two fields of the class 1000 definition in Figure 2 are 

30 Last_Name and First_Name. Both of these fields contain 
primitive forms of a class named String_Twenty_ASCII, 
class 10. The class 10 class definition is given in 
Figure 5 and specifies that instances of forms of this 
class contain ASCII character strings which are 20 

35 characters long. 

Figure 3 gives another constructed class definition 
named Player_Address , class 1001. Instances of forms of 
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this class each contain three fields named Street, City 
and State, Each of these three fields contains primitive 
forms of the class named String_20_ASCII, class 10. 
Again, the class definition for class 10 is given in 
5 Figure 5 and specifies a data representation of 
20-character ASCII strings. 

An example of the nesting of constructed class forms 
is given in Figure 4. Figure 4 is a class definition for 
instances of forms in the class named Tournament Entry, 
10 class 1002. Each instance of a form in this class 

contains three fields named Tournament_Name, Player, and 
Address. The field Tournament_Name includes forms of the 
primitive class named String_Twenty_ASCIl, class 10 
defined in Figure 5. The field named Player contains 
15 instances of constructed forms of the class named 

Player_Name, class 1000 having the format and semantic 
characteristics given in Figure 2. The field named Ad- 
dress contains instances of the constructed form of 
constructed forms of the constructed class named 
20 Player_Address, class 1001, which has the format and 

semantic characteristics given in the class definition of 
Figure 3. 

The class definition of Figure 4 shows how nesting of 
forms can occur in that each field of a form is a form 

25 itself and every form may be either primitive and have 

only one field or constructed and have several fields. In 
other words , instances of a form may have as many fields 
as necessary, and each field may have as many subfields as 
necessary. Further, each subfield may have as many 

30 sub-subfields as necessary. This nesting goes on for any 
arbitrary number of levels. This data structure allows 
data of arbitrary complexity to be easily represented and 
manipulated. 

Referring to Figure 6 there is shown an instance of a 
35 form of the class of forms named Tournament Entry, class 
1002, as stored as an object in memory. The block of data 
38 contains the constructed class number 1002 indicating 
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that this is an instance of a form of the constructed 
class named Tournament_Entry . The block of data 40 
indicates that this class of form has three fields. Those 
three fields have blocks of data shown at 42, 44, and 46 
5 containing the class numbers of the forms in these fields. 
The block of data at 42 indicates that the first fie" 
contains a form of class 10 as shown in Figure 5. A class 
10 form is a primitive form containing a 20- character 
string of ASCII characters as defined in the class 

10 definition for class 10 in Figure 5. The actual string of 
ASCII characters for this particular instance of this form 
is shown at 48, indicating that this is a tournament entry 
for the U.S. Open tennis tournament. The block of data at 
44 indicates that the second field contains a form which 

15 is an instance of a constructed form of class 1000. 

Reference to this class definition shows that this class 
is named Player_Name. The block of data 50 shows that 
this class of constructed form contains four subfields. 
Those fields contain forms of the classes recorded in the 

20 blocks of data shown at 52, 54, 56 and 58. These fields 
would be subfields of the field 44. The first subfield 
has a block of data at 52, indicating that this subfield 
contains a form of primitive class 11. This class of form 
is defined in Figure 5 as containing a floating-point 

25 two-digit number with one decimal place. The actual data 
for this instance of the form is shown at 60, indicating 
that this player has an NTRP rating of 3.5. The second 
subfield has a block of data at 54, indicating that this 
subfield contains a form of primitive class 12. The class 

30 definition for this class indicates that the class is 
named integer and contains integer data. The class 
definition for class 1000 shown in Figure 2 indicates that 
this integer data, shown at block 62, is the player's age. 
Note that the class definition semantic data regarding 

35 field names is not stored in the form instance. Only the 
format or type information is stored in the form instance 
in the form of the class ID for each field. 
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The third subfield has a block of data at 56, 
indicating that this subfield contains a form of primitive 
class 10 named String_20_ASCII. This subfield corresponds 
to the field Last_Name in the form of class Player_Name, 
5 class 1000, shown in Figure 2. The primitive class 10 
class definition specifies that instances of this 
primitive class contain a 20-character ASCII string. This 
string happens to define the player's last name. In the 
instance shown in Figure 6, the player's last name is 
10 Blackett, as shown at 64. 

The last subfield has a block of data at 58 , 
indicating that the field contains a primitive form of 
primitive class 10 which is a 20-character ASCII string. 
This subfield is defined in the class definition of class 
15 1000 as containing the player's first name. This ASCII 
string is shown at 66. 

The third field in the instance of the form of class 
1002 has a block of data at 46, indicating that this field 
contains a constructed form of the constructed class 1001. 
20 The class definition for this class is given in Figure 3 
and indicates the class is named Player_Address . The 
block of data at 68 indicates that this field has three 
subfields containing forms of the class numbers indicated 
at 70, 72 and 74. These subfields each contain forms of 
25 the primitive class 10 defined in Figure 5. Each of these 
subfields therefore contains a 20-character ASCII string. 
The contents of these three fields are defined in the 
class definition for class 1001 and are, respectively, the 
street, city and state entries for the address of the 
30 player named in the field 44. These 3-character strings 
are shown at 76, 78 and 80, respectively. 

Referring to Figure 7, there is shown a partition of 
the semantic information, format information and actual 
data between the class definition and instances of forms 
35 of this class. The field name and format or type 
information are stored in the class definition, as 
indicated by box 82. The format or type information (in 
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the form of the class ID) and actual data or field values 
are stored in the instance of the form as shown by box 72. 
For example, in the instance of the form of class 
Tournament_Entry, class 1002 shown in Figure 6, the format 
5 data for the first field is the data stored in block 42, 
while the actual data for the first field is the data 
shown at block 48. Essentially, the class number or class 
ID is equated by the communications interface with the 
specification for the type of data in instances of forms 

10 of that primitive class. Thus, the communications inter- 
face can perform format operations on instances of a 
particular form using only the format data stored in the 
instance of the form itself without the need for access to 
the class definition. This speeds up format operations by 

15 eliminating the need for the performance of the steps 
required to access a class definition which may include 
network access and/or disk access, which would 
substantially slow down the operation, since format- type 
operations comprise the bulk of all operations in exchang- 

20 ing data between foreign processes, the data structure and 
the library of programs to handle the data structure 
defined herein greatly increase the efficiency of data 
exchange between foreign processes and foreign computers. 
For example, suppose that the instance of the form 

25 shown in Figure 6 has been generated by a process running 
on a computer by Digital Equipment Corporation (DEC) and 
therefore text is expressed in ASCII characters. Suppose 
also that this form is to be sent to a process running on 
an IBM computer, where character strings are expressed in 

30 EBCDIC code. Suppose also that these two computers were 
coupled by a local area network using the ETHERNET™ 
communications protocol. 

To make this transfer, several format operations 
would have to be performed. These format operations can 

35 best be understood by reference to Figure 1 with the as- 
sumption that the DEC computer is host 1 shown at 10 and 
the IBM computer is host 2 shown at 12. 



WO 92/07324 



PCT/US91/07652 



-25- 

The first format operation to transfer the instance 
of the form shown in Figure 6 from application 16 to 
application 18 would be a conversion from the format shown 
in Figure 6 to a packed format suitable for transfer via 
5 network 14. Networks typically operate on messages 
comprised of blocks of data comprising a plurality of 
bytes packed together end to end preceded by multiple 
bytes of header information which include such things as 
the message length, the destination address, the source 
10 address, and so on, and having error correction code bits 
appended to the end of the message. Sometimes delimiters 
. are used to mark the start and end of the actual data 
block. 

The second format operation which would have to be 

15 performed in this hypothetical transfer would be a 

conversion from the packed format necessary for transfer 
over network 14 to the format used by the application 18 
and the host computer 12. 

Format operations are performed by the forms-manager 

20 modules of the communications interface. For example, the 
first format operation in the hypothetical transfer would 
be performed by the forms-manager module 86 in Figure 1, 
while the second format operation in the hypothetical 
transfer would be performed by the forms-manager module in 

25 the data-exchange component 88. 

Referring to Figure 8, there is shown a flowchart of 
the operations performed by the forms-manager modules in 
performing format operations. Further details regarding 
the various functional capabilities of the routines in the 

30 forms-manager modules of the communications interface will 
be found in the functional specifications for the various 
library routines of the communications interface included 
herein. The process of Figure 8 is implemented by the 
software programs in the forms-manager modules of the 

35 data-exchange components in the communications interface 
according to the teachings of the invention. The first 
step is to receive a format conversion call from either 



WO 52/07324 



PCT/US91/07652 



-26- 

the application or from another module in the 
communications interface. This process is symbolized by 
block 90 and the pathways 92 and 94 in Figure 1. The same 
type call can be made by the application 18 or the 
5 communications component 96 for the host computer 12 in 
Figure 1 to the forms-manager module in the data-exchange 
component 88 , since this is a standard functional 
capability or "tool" provided by the communication inter- 
face of the invention to all client applications. Every 
10 client application will be linked to a communication 
interface like interface 20 in Figure 1. 

Typically, format conversion calls from the com- 
munication components such as modules 30 and 96 in Figure 
1 to the forms-manager module will be from a service 
15 discipline module which is charged with the task of send- 
ing a form in format 1 to a foreign application which uses 
format 2. Another likely scenario for a format conversion 
call from another module in the communication interface is 
when a service discipline has received a form from another 
20 application or service which is in a foreign format and 
which needs to be converted to the format of the client 
application. 

The format conversion call will have parameters 
associated with it which are given to the forms manager. 
25 These parameters specify both the "from" format and the 
"to" or "target" format. 

Block 98 represents the process of accessing an 
appropriate target format-specific table for the specified 
conversion, i.e., the specified "from" format and the 
30 specified "to" format will have a dedicated table that 
gives details regarding the appropriate target format 
class for each primitive "from" format class to accomplish 
the conversion. There are two tables which are accessed 
sequentially during every format conversion operation in 
35 the preferred embodiment. In alternative embodiments, 
these two tables may be combined. Examples of the two 
tables used in the preferred embodiment are shown in 
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Figures 9, 10 and 11. Figure 9 shows a specific format 
conversion table for converting from DEC machines to X.409 
format. Figure 10 shows a format-specific conversion 
table for converting from X.409 format to IBM machine 
5 format. Figure 11 shows a general conversion procedures 
table identifying the name of the conversion program in 
the communications interface library which performs the 
particular conversion for each "from"-"to" format pair. 

The tables of Figures 9 and 10 probably would not be 
10 the only tables necessary for sending a form from the 
application 16 to the application 18 in Figure 1. There 
may be further format-specific tables necessary for 
conversion from application 16 format to DEC machine 
format and for conversion from IBM machine format to ap- 
15 plication 18 format. However, the general concept of the 
format conversion process implemented by the forms-manager 
modules of the communications interface can be explained 
with reference to Figures 9, 10 and 11. 

Assume that the first conversion necessary in the 
20 process of sending a form from application 16 to ap- 
plication 18 is a conversion from DEC machine format to a 
packed format suitable for transmission over an ETHERNET"* 
network. In this case, the format conversion call 
received in step 90 would invoke processing by a software 
25 routine in the forms-manager module which would perform 
the process symbolized by block 98. 

In this hypothetical example, the appropriate 
format-specific table to access by this routine would be 
determined by the "from" format and "to" format parameters 
30 in the original format conversion call received by block 
90. This would cause access to the table shown in Figure 
9. The format conversion call would also identify the 
address of the form to be converted. 

The next step is symbolized by block 100. This step 
35 involves accessing the form identified in the original 
format conversion call and searching through the form to 
find the first field containing a primitive class of form. 
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In other words, the record is searched until a field is 
found storing actual data as opposed to another 
constructed form having subfields. 

In the case of the form shown in Figure 6, the first 
5 field storing a primitive class of form is field 42 . The 
"from" column of the table of Figure 9 would be searched 
using the class number 10 until the appropriate entry was 
found. In this case, the entry for a "from" class of 10 
indicates that the format specified in the class 

10 definition for primitive class 25 is the "to" format. 
This process of looking up the "to" format using the 
"from" format is symbolized by block 102 in Figure 8. The 
table shown in Figure 9 may be "hardwired" into the code 
of the routine which performs the step symbolized by block 

15 102. 

Alternatively, the table of Figure 9 may be a 
database or other file stored somewhere in the network 
file system 24 in Figure l. In such a case, the routine 
performing the step 102 in Figure 8 would know the network 

20 address and file name for the file to access for access to 
the table of Figure 9. 

Next, the process symbolized by block 104 in Figure 8 
is performed by accessing the general conversion 
procedures table shown in Figure 11. This is a table 

25 which identifies the conversion program in the forms 

manager which performs the actual work of converting one 
primitive class of form to another primitive class of 
form. This table is organized with a single entry for 
every "from ,, - n to n format pair. Each entry in the table 

30 for a M from n - l, to w pair includes the name of the conversion 
routine which does the actual work of the conversion. The 
process symbolized by block 104 comprises the steps of 
taking the n from l, -"to n pair determined from access to the 
format-specific conversion table in step 102 and searching 

35 the entries of the general conversion procedures table 
until an entry having a "from"-"to M match is found. In 
this case, the third entry from the top in the table of 
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Pigure 11 matches the w from"-"to" format pair found in the 
access to Figure 9. This entry is read, and it is 
determined that the name of the routine to perform this 
conversion is ASCII_ETHER. (In many embodiments, the 
5 memory address of the routine, opposed to the name, would 
be stored in the table.) 

Block 106 in Figure 8 symbolizes the process of 
calling the conversion program identified by step 104 and 
performing this conversion routine to change the contents 
10 of the field selected in step 100 to the M to n or target 
format identified in step 102. In the hypothetical 
example, the routine ASCII_ETHER would be called and 
performed by step 106. The call to this routine would 
deliver the actual data stored in the field selected in 
15 the process of step 100, i.e., field 42 of the instance of 
a form shown in Figure 6, such that the text string "U.S. 
Open" would be converted to a packed ETHERNET" 1 format. 

Next, the test of block 108 is performed to determine 
if all fields containing primitive classes of forms have 
20 been processed. If they have, then format conversion of 
the form is completed, and the format conversion routine 
is exited as symbolized by block 110. 

If fields containing primitive classes of forms 
remain to be processed, then the process symbolized by 
25 block 112 is performed. This process finds the next field 
containing a primitive class of form. 

Thereafter, the processing steps symbolized by blocks 
102, 104, 106, and 108 are performed until all fields 
containing primitive classes of forms have been converted 
30 to the appropriate w to w format. 

As noted above, the process of searching for fields 
containing primitive classes of forms proceeds serially 
through the form to be converted. If the next field 
encountered contains a form of a constructed class, that 
35 class of form must itself be searched until the first 
field therein with a primitive class of form is located. 
This process continues through all levels of nesting for 
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all fields until all fields have been processed and all 
data stored in the form has been converted to the ap- 
propriate format. As an example of how this works, in the 
form of Figure 6, after processing the first field 42, the 
5 process symbolized by block 112 in Figure 8 would next 
encounter the field 44 (fields will be referred to by the 
block of data that contain the class ID for the form 
stored in that field although the contents of the field 
are both the class ID and the actual data or the fields 

10 and subf ields of the form stored in that field) • Note 

that in the particular class of form represented by Figure 
6, the second field 44 contains a constructed form 
comprised of several subf ields* Processing would then 
access the constructed form of class 1000 which is stored 

15 by the second field and proceeds serially through this 

constructed form until it locates the first field thereof 
which contains a form of a primitive class. In the 
hypothetical example of Figure 6, the first field would be 
the subfield indicated by the class number 11 at 52. The 

20 process symbolized by block 102 would then look up class 
11 in the "from" column in the table of Figure 9 and 
determine that the target format is specified by the class 
definition of primitive class 15. This "from l, - ,, to ,, pair 
11-15 would then be compared to the entries of the table 

25 of Figure 11 to find a matching entry. Thereafter, the 
process of block 106 in Figure 8 would perform the 
conversion program called Floatl_ETHER to convert the 
block of data at 60 in Figure 6 to the appropriate 
ETHERNET™ packed format. The process then would continue 

30 through all levels of nesting. 

Referring to Figure 12, there is shown a flowchart 
for a typical semantic-dependent operation. 
Semantic-dependent operations allow decoupling of applica- 
tions by allowing one application to get the data in a 

35 particular field of an instance of a form generated by a 
foreign application provided that the field name is known 
and the address of the form instance is known. The com- 
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munications interface according to the teachings of the 
invention receives semantic-dependent operation requests 
from client applications in the form of Get_Pield calls in 
the preferred embodiment where all processes use the same 
5 field names for data fields which mean the same thing 
(regardless of the organization of the form or the data 
representation of the field in the form generated by the 
foreign process). In alternative embodiments, an aliasing 
or synonym table or data base is used. In such 
10 embodiments, the Get Field call is used to access the 
synonym table in the class manager and looks for all 
synonyms of the requested field name. All field names 
which are synonyms of the requested field name are 
returned. The class manager then searches the class 
15 definition for a match with either the requested field 

name or any of the synonyms and retrieves the field having 
the matching field name. 

Returning to consideration of the preferred 
embodiment, such Get_Field calls may be made by client ap- 
20 plications directly to the forms-class manager modules 

such as the module 122 in Figure 1, or they may be made to 
the communications components or f orms-manager modules and 
transferred by these modules to the forms-class manager. 
The forms-class manager creates , destroys, manipulates , 
25 stores and reads form-class definitions. 

A Get_Field call delivers to the forms-class manager 
the address of the form involved and the name of the field 
in the form of interest. The process of receiving such a 
request is symbolized by block 120 in Figure 12. Block 
30 120 also symbolizes the process by which the class manager 
is given the class definition either programmatically, 
i.e., by the requesting application, or is told the 
location of a data base where the class definitions 
including the class definition for the form of interest 
35 may be found. There may be several databases or files in 
the network file system 24 of Figure l wherein class 
definitions are stored. It is only necessary to give the 
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forms-class manager the location of the particular file in 
which the class definition for the form of interest is 
stored. 

Next, as symbolized by block 122, the class-manager 
5 module accesses the class definition for the form class 
identified in the original call. 

The class manager then searches the class definition 
field names to find a match for the field name given in 
the original call* This process is symbolized by block 
10 124. 

After locating the field of interest in the class 
definition, the class manager returns a relative address 
pointer to the field of interest in instances of forms of 
this class. This process is symbolized by block 126 in 

15 Figure 12. The relative address pointer returned by the 
class manager is best understood by reference to Figures 
2, 4 and 6. Suppose that the application which made the 
Get_Field call was interested in determining the age of a 
particular player. The Get_Field request would identify 

20 the address for the instance of the form of class 1002 for 
player Blackett as illustrated in Figure 6. Also included 
in the Get_Field request would be the name of the field of 
interest, i.e., "age 11 . The class manager would then 
access the instance of the form of interest and read the 

25 class number identifying the particular class descriptor 
or class definition which applied to this class of forms. 
The class manager would then access the class descriptor 
for class 1002 and find a class definition as shown in 
Figure 4. The class manager would then access the class 

30 definitions for each of the fields of class definition 
1002 and would compare the field name in the original 
Get_Field request to the field names in the various class 
definitions which make up the class definition for class 
1002. In other words, the class manager would compare the 

35 names of the fields in the class definitions for classes 
10, 1000, and 1001 to the field name of interest, "Age". 
A match would be found in the class definition for class 
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1000 as seen from Figure 2. For the particular record 
format shown in Figure 6, the "Age" field would be the 
block of data 62, which is the tenth block of data in from 
the start of the record. The class manager would then 
5 return a relative address pointer of io in block 126 of 
Figure 12. This relative address pointer is returned to 
the client application which made the original Get Field 
call. The client application then issues a Get_Data call 
to the forms-manager module and delivers to the 
10 forms-manager module the relative address of the desired 
field in the particular instance of the form of interest. 
The forms-manager module must also know the address of the 
instance of the form of interest which it will already 
have if the original Get_Field call came through the 
15 forms-manager module and was transferred to the 

forms-class manager. If the forms-manager module does not 
have the address of the particular instance of the form of 
interest, then the forms manager will request it from the 
client application. After receiving the Get_Data call and 
20 obtaining the relative address and the address of the 

instance of the form of interest, the forms manager will 
access this instance of the form and access the requested 
data and return it to the client application. This proc- 
ess of receiving the Get_Data call and returning the ap- 
25 propriate data is symbolized by block 128 in Figure 12. 
Normally, class-manager modules store the class 
definitions needed to do semantic-dependent operations in 
RAM of the host machine as class descriptors. Class 
definitions are the specification of the semantic and 
30 formation information that define a class. Class descrip- 
tors are memory objects which embody the class definition. 
Class descriptors are stored in at least two ways. In 
random access memory (RAM) , class descriptors are stored 
as forms in the format native to the machine and client 
35 application that created the class definition. Class 
descriptors stored on disk or tape are stored as ASCII 
strings of text. 
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When the class-manager module is asked to do a 
semantic-dependent operation, it searches through its 
store of class descriptors in RAM and determines if the 
appropriate class descriptor is present. If it is, this 
5 class descriptor is used to perform the operation detailed 
above with reference to Figure 12. If the appropriate 
class descriptor is not present, the class manager must 
obtain it. This is done by searching through known files 
of class descriptors stored in the system files 24 in 

10 Figure 1 or by making a request to the foreign application 
that created the class definition to send the class 
definition to the requesting module. The locations of the 
files storing class descriptors are known to the client 
applications, and the class-manager modules also store 

15 these addresses. Often, the request for a 

semantic-dependent operation includes the address of the 
file where the appropriate class descriptor may be found. 
If the request does not contain such an address, the class 
manager looks through its own store of class descriptors 

20 and through the files identified in records stored by the 
class manager identifying the locations of system class 
descriptor files. 

If the class manager asks for the class descriptor 
from the foreign application that generated it, the 

25 foreign application sends a request to its class manager 
to send the appropriate class descriptor over the network 
to the requesting class manager or the requesting module. 
The class descriptor is then sent as any other form and 
used by the requesting class manager to do the requested 

30 semantic-dependent operation. 

If the class manager must access a file to obtain a 
class descriptor, it must also convert the packed ASCII 
representation in which the class descriptors are stored 
on disk or tape to the format of a native form for storage 

35 in RAH. This is done by parsing the ASCII text to 

separate out the various field names and specifications of 
the field contents and the class numbers. 
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Figures 13A and 13B illustrate, respectively, a class 
definition and the structure and organization of a class 
descriptor for the class definition of Figure 13A and 
stored in memory as a form. The class definition given in 
5 Figure 13A is named Person_Class and has only two fields, 
named last and first. Each of these fields is specified 
to store a 20-character ASCII string. 

Figure 13B has a data block 140 which contains 1021 
indicating that the form is a constructed form having a 
10 class number 1021. The data block at 142 indicates that 
the form has 3 fields. The first field contains a 
primitive class specified to contain an ASCII string which 
happens to store the class name, Person_Class, in data 
block 146. The second field is of a primitive class as- 
15 signed the number 2, data block 148, which is specified to 
contain a boolean value, data block 150. Semantically, 
the second field is defined in the class definition for 
class 1021 to define whether the form class is primitive 
(true) or constructed (false) . In this case, data block 
20 150 is false indicating that class 1021 is a constructed 
class. The third field is a constructed class given the 
class number 112 as shown by data block 152. The class 
definition for class 1021 defines the third field as a 
constructed class form which gives the names and 
25 specifications of the fields in the class definition. 

Data block 154 indicates that two fields exist in a class 
112 form. The first field of class 112 is itself a 
constructed class given the class number 150, data block 
156, and has two subfields, data block 158. The first 
30 subfield is a primitive class 15, data block 160, which is 
specified in the class definition for class 150 to contain 
the name of the first field in class 1021. Data block 162 
gives the name of the first field in class 1021. The 
second subfield is of primitive class 15, data block 164, 
35 and is specified in the class definition of class 150 (not 
shown) to contain an ASCII string which specifies the 
representation, data block 166, of the actual data stored 



WO 52/07324 



PCT/US91/07652 



-36- 

in the first field of class 1021. The second field of 
class 112 is specified in the class definition of class 
112 to contain a constructed form of class 150 , data block 
168, which has two fields, data block 170, which give the 
5 name of the next field in class 1021 and specify the type 
of representation of the actual data stored in this second 
field. 

DATA DISTRIBUTION AND SERVICE PROTOC OL DECOUPLING BY 
SUBJECT-BASED ADDRESSING AND THE USE OF SERVICE DISCIPLINE 

10 PROTOCOL LAYERS 

Referring to Figure 14, there is shown a block 
diagram of the various software modules, files, networks, 
and computers which cooperate to implement two important 
forms of decoupling. These forms of decoupling are data 

15 distribution decoupling and service protocol decoupling. 
Data distribution decoupling means freeing client applica- 
tions from the necessity to know the network addresses for 
servers providing desired services. Thus, if a particular 
application needs to know information supplied by, for 

20 example, the Dow Jones news service, the client applica- 
tion does not need to know which servers and which loca- 
tions are providing data from the Dow Jones news service 
raw data feed. 

Service protocol decoupling means that the client 

25 applications need not know the particular communications 
protocols used by the servers, services or other 
applications with which exchanges of data are desired. 

Data distribution decoupling is implemented by the 
communications module 30 in Figure 14. The communications 

30 component is comprised of a library of software routines 
which implement a subject mapper 180 and a plurality of 
service disciplines to implement subject-based addressing. 
Service disciplines 182, 184 and 186 are exemplary of the 
service disciplines involved in subject-based addressing. 

35 Subject-based addressing allows services to be 

modified or replaced by alternate services providing 
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consumers. This decoupling of the information consumers 
from information providers permits a higher degree of 
modularization and flexibility than that provided by 
5 traditional service-oriented models. 

Subject-based addressing starts with a subscribe call 
188 to the subject mapper 180 by a client application 16 
running on host computer 10 • The subscribe call is a 
request for information regarding a particular subject. 
10 Suppose hypothetically that the particular subject was 
equity. IBM. news. This subscribe call would pass two 
parameters to the subject mapper 180. One of these 
parameters would be the subject equity • IBM . news . The 
other parameter would be the name of a callback routine in 
15 the client application 16 to which data regarding the 
subject is to be passed. The subscribe call to the 
subject mapper 180 is a standard procedure call. 

The purpose of the subject mapper is to determine the 
network address for services which provide information on 
20 various subjects and to invoke the appropriate service 

discipline routines to establish communications with those 
services. To find the location of the services which 
provide information regarding the subject in the subscribe 
call, the subject mapper 80 sends a request symbolized by 
25 line 190 to a directory-services component 192. The 

directory-services component is a separate process running 
on a computer coupled to the network 14 and in fact may be 
running on a separate computer or on the host computer 10 
itself. The directory-services routine maintains a data 
30 base or table of records called service records which 
indicate which services supply information on which 
subjects, where those services are located, and the 
service disciplines used by those services for 
communication. The directory-services component 192 
35 receives the request passed from the subject mapper 180 
and uses the subject parameter of that request to search 
through its tables for a match. That is, the 
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directory-services component 192 searches through its 
service records until a service record is found indicating 
a particular service or services which provide information 
on the desired subject. This service record is then 
5 passed back to the subject mapper as symbolized by line 
194. The directory-services component may find several 
matches if multiple services supply information regarding 
the desired subject. 

The service record or records passed back to the 

10 subject mapper symbolized by line 194 contain many fields. 
Two required fields in the service records are the name of 
the service which provides information on the desired 
subject and the name of the service discipline used by 
that service. Other optional fields which may be provided 

15 are the name of the server upon which said service is run- 
ning and a location on the network of that server* 

Generally, the directory-services component will 
deliver all the service records for which there is a 
subject map, because there may not be a complete overlap 

20 in the information provided on the subject by all 

services. Further, each service will run on a separate 
server which may or may not be coupled to the client ap- 
plication by the same network. If such multiplicity of 
network paths and services exists, passing all the service 

25 records with subject matter matches back to the subject 
mapper provides the ability for the communications inter- 
face to switch networks or switch servers or services in 
the case of failure of one or more of these items. 

As noted above, the subject mapper 180 functions to 

30 set up communications with all of the services providing 
information on the desired subject. If multiple service 
records are passed back from the directory-services module 
192, then the subject mapper 180 will set up 
communications with all of these services. 

35 Upon receipt of the service records, the subject 

mapper will call each identified service discipline and 
pass to it the subject and the service record applicable 
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to that service discipline. Although only three service 
disciplines 182, 184 and 186 are shown in Figure 14, there 
may be many more than three in an actual system. 

In the event that the directory-services component 
5 192 does not exist or does not find a match, no service 
records will be returned to the subject mapper 180. in 
such a case, the subject mapper will call a default 
service discipline and pass it and the subject and a null 
record . 

10 Each service discipline is a software module which 

contains customized code optimized for communication with 
the particular service associated with that service 
discipline. 

Each service discipline called by the subject mapper 
15 180 examines the service records passed to it and 

determines the location of the service with which com- 
munications are to be established. In the particular 
hypothetical example being considered, assume that only 
one service record is returned by the directory-services 
20 module 192 and that that service record identifies the Dow 
Jones news service running on server 196 and further 
identifies service discipline A at 182 as the appropriate 
service discipline for communications with the Dow Jones 
news service on server 196. Service discipline A will 
25 then pass a request message to server 196 as symbolized by 
line 198. This request message passes the subject to the 
service and may pass all or part of the service record. 

The server 196 processes the request message and 
determines if it can, in fact, supply information regard- 
30 ing the desired subject. It then sends back a reply mes- 
sage symbolized by line 200. 

Once communications are so established, the service 
sends all items of information pertaining to the requested 
subject on a continual basis to the appropriate service 
35 discipline as symbolized by path 202. in the example 

chosen here, the service running on server 196 filters out 
only those news items which pertain to IBM for sending to 
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service discipline at 182. In other embodiments, the 
server may pass along all information it has without 
filtering this information by subject. The communications 
component 30 then filters out only the requested 
5 information and passes it along to the requesting 

application 16. In some embodiments this is done by the 
daemon to be described below, and in other embodiments , it 
is done elsewhere such as in the information or service 
layers to be described below. 

10 Each service discipline can have a different 

behavior. For example, service discipline B at 184 may 
have the following behavior. The service running on 
server 196 may broadcast all news items of the Dow Jones 
news service on the network 14. All instances of service 

15 discipline B may monitor the network and filter out only 
those messages which pertain to the desired subject. Many 
different communication protocols are possible. 

The service discipline A at 182 receives the data 
transmitted by the service and passes it to the named 

20 callback routine 204 in the client application 16. (The 
service discipline 182 was passed the name of the callback 
routine in the initial message from the mapper 180 symbol- 
ized by line 181.) The named callback routine then does 
whatever it is programmed to do with the information 

25 regarding the desired subject. 

Data will continue to flow to the named callback 
routine 204 in this manner until the client application 16 
expressly issues a cancel command to the subject mapper 
180. The subject mapper 180 keeps a record of all 

30 subscriptions in existence and compares the cancel command 
to the various subscriptions which are active. If a match 
is found, the appropriate service discipline is notified 
of the cancel request, and this service discipline then 
sends a cancel message to the appropriate server. The 

35 service then cancels transmission of further data regard- 
ing that subject to the service discipline which sent the 
cancel request. 
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It is also possible for a service discipline to stand 
alone and not be coupled to a subject mapper. In this 
case the service discipline or service disciplines are 
linked directly to the application, and subscribe calls 
5 are made directly to the service discipline. The 

difference is that the application must know the name of 
the service supplying the desired data and the service 
discipline used to access the service. A database or 
directory-services table is then accessed to find the 
10 network address of the identified service, and communica- 
tions are established as defined above. Although this 
software architecture does not provide data distribution 
decoupling, it does provide service protocol decoupling, 
thereby freeing the application from the necessity to know 
15 the details of the communications interface with the 
service with which data is to be exchanged. 

More details on subject-based addressing subscription 
services provided by the communications interface 
according to the teachings of the invention are given in 
20 Section 4 of the communications interface specification 
given below. The preferred embodiment of the 
communications interface of the invention is constructed 
in accordance with that specification. 

An actual subscribe function in the preferred 
25 embodiment is done by performing the TlB_Consume Create 
library routine described in Section 4 of the specifica- 
tion. The call to TIB_Consume_Create includes a property 
list of parameters which are passed to it, one of which is 
the identity of the callback routine specified as 
30 My_Message_Handler in Section 4 of the specification. 

In the specification, the subject-based addressing 
subscription service function is identified as TIBINFO. 
The TIBINFO interface consists of two libraries. The 
first library is called TIBINFO_CONSUME for data 
35 consumers. The second library is called TIBINFO_PUBLISH 
for data providers. An application includes one library 
or the other or both depending on whether it is a consumer 
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or a provider or both. An application can simultaneously 
be a consumer and a provider. 

Referring to Figure 15, there is shown a block 
diagram of the relationship of the communications inter- 
5 face according to the teachings of the invention to the 
applications and the network that couples these applica- 
tions. Blocks having identical reference numerals to 
blocks in Figure 1 provide similar functional capabilities 
as those blocks in Figure 1. The block diagram in Figure 

10 15 shows the process architecture of the preferred 

embodiment. The software architecture corresponding to 
the process architecture given in Figure 15 is shown in 
block form in Figure 16. 

The software architecture and process architecture 

15 detailed in Figures 15 and 16, respectively, represents an 
alternative embodiment to the embodiment described above 
with reference to Figures 1-14. 

Referring to Figure 15, the communications component 
30 of Figure 1 is shown as two separate functional blocks 

20 3 OA and 3 OB in Figure 15. That is, the functions of the 
communications component 30 in Figure 1 are split in the 
process architecture of Figure 15 between two functional 
blocks. A communications library 30A is linked with each 
client application 16, and a backend communications daemon 

25 process 3 0B is linked to the network 14 and to the 
communication library 3 OA. There is typically one 
communication daemon per host processor. This host 
processor is shown at 230 in Figure 15 but is not shown at 
all in Figure 16. Note that in Figure 15, unlike the 

30 situation in Figure 1, the client applications 16 and 18 
are both running on the same host processor 230. Each 
client application is linked to its own copies of the 
various library programs in the communication libraries 
30A and 96 and the form library of the data-exchange 

35 components 32 and 88. These linked libraries of programs 
share a common communication daemon 3 0B. 

The communication daemons on the various host 
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processors cooperate among themselves to insure reliable, 
efficient communication between machines. For subject 
addressed data, the daemons assist in its efficient 
transmission by providing low-level system support for 
5 filtering messages by subject. The communication daemons 
implement various communication protocols described below 
to implement fault tolerance, load balancing and network 
efficiency. 

The communication library 3 OA performs numerous 
10 functions associated with each of the application-oriented 
communication suites. For example, the communication 
library translates subjects into efficient message headers 
that are more compact and easier to check than ASCII 
subject values. The communications library also maps 
15 service requests into requests targeted for particular 
service instances, and monitors the status of those 
instances . 

The data-exchange component 32 of the communications 
interface according to the teachings of the invention is 

20 implemented as a library called the w form library." This 
library is linked with the client application and provides 
all the core functions of the data-exchange component. 
The form library can be linked independently of the 
communication library and does not require the 

25 communication daemon 3 OB for it 6 operation. 

The communication daemon serves in two roles. In the 
subject-based addressing mode described above where the 
service instance has been notified of the subject and the 
network address to which data is to be sent pertaining to 

30 this subject, the communication daemon 3 OB owns the 

network address to which the data is sent. This data is 
then passed by the daemon to the communication library 
bound to the client application, which in turn passes the 
data to the appropriate callback routine in the client 

35 application. In another mode, the communication daemon 
filters data coming in from the network 14 by subject when 
the service instances providing data are in a broadcast 
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mode and are sending out data regarding many different 
subjects to all daemons on the network. 

The blocks 231, 233 and 235 in Figure 15 represent 
the interface functions which are implemented by the 
5 programs in the communication library 3 OA and the form 
library 32. The TIBINFO interface 233 provides 
subject-based addressing services by tht communication 
paradigm known as the subscription call. In this 
paradigm, a data consumer subscribes to a service or 

10 subject and in return receives a oontira us stream of data 
about the service or subject until the consumer explicitly 
terminates the subscription (or a failure occurs) . A 
subscription paradigm is well suited to real-time 
applications that monitor dynamically changing values, 

15 such as a stock price. In contrast, the more traditional 
request/reply communication is ill suited to such 
real-time applications, since it requires data consumers 
to "poll" data providers to learn of changes. 

The interface 235 defines a programmatic interface to 

20 the protocol suite and service comprising the Market Data 
Subscription Service (MDSS) sub-component 234 in Figure 
16. This service discipline will be described more fully 
later. The RMDP interface 235 is a service address 
protocol in that it requires the client application to 

25 know the name of the service with which data is to be 
exchanged. 

In Figure 16 there is shown the software architecture 
of the system. A distributed communications component 232 
includes various protocol engines 237, 239 and 241. a 

30 protocol engine encapsulates a communication protocol 
which interfaces service discipline protocols to the 
particular network protocols. Each protocol engine 
encapsulates all the logic necessary to establish a highly 
reliable, highly efficient communication connection. Each 

J5 protocol engine is tuned to specific network properties 
and specific applications properties. The protocol 
engines 237, 239 and 241 provide a generic communication 
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interf ace to the client applications such as applications 
16 and 18. This frees these applications (and the 
programmers that write them) from the need to know the 
specific network or transport layer protocols needed to 
5 communicate over a particular network conf iguration. 

Further, if he network configuration or any of the network 
protocols are changed such as by addition of a new local 
area network, gateway etc. or switching of transport layer 
protocols say from DECNET™ to TCP/IP™, the application 
10 programs need not be changed. Such changes can be 

accommodated by the addition, substitution or alteration 
. of the protocol engines so as to accommodate the change. 
Since these protocol engines are shared, there is less 
effort needed to change the protocol engines than to 
15 change all the applications. 

The protocol engines provide protocol transparency 
and communication path transparency to the applications 
thereby freeing these applications from the need to have 
code which deals with all these details. Further, these 
20 protocol engines provide network interface transparency. 

The protocol engines can also provide value added 
services in some embodiments by implementing reliable 
communication protocols. Such value added services 
include reliable broadest and reliable point to point 
25 communications as well as Reliable Multicast™ 

communications where communications are switched from 
reliable broadcast to reliable point to point when the 
situation requires this change for efficiency. Further, 
the protocol engines enhance broadcast operations where 
30 two or more applications are requesting data on a subject 
by receiving data directed to the first requesting 
application and passing it along to the other requesting 
applications. Prior art broadcast software does not have 
this capability. 
35 The protocol engines also support efficient subject 

based addressing by filtering messages received on the 
network by subject. In this way, only data on the 
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reguested subject gets passed to the callback routine in 
the requesting application. In the preferred embodiment, 
the protocol engines coupled to the producer applications 
or service instances filter the data by subject before it 
5 is placed in the network thereby conserving network 

bandwidth, input/ output processor bandwidth and overhead 
processing at the receiving ends of communication links. 

The distributed communication component 232 
(hereafter DCC) in Figure 16 is structured to meet several 

10 important objectives. First, the DCC provides a simple, 
stable and uniform communication model. This provides 
several benefits. It shields programmers from the 
complexities of: the distributed environment; locating a 
target process; establishing communications with this 

15 target process; and determining when something has gone 
awry. All these tasks are best done by capable 
communications infrastructure and not by the programmer. 
Second, the DCC reduces development time not only by 
increasing programmer productivity but also by simplifying 

20 the integration of new features. Finally, it enhances 

configurability by eliminating the burden on applications 
to know the physical distribution on the network of other 
components. This prevents programmers from building 
dependencies in their code on particular physical 

25 configurations which would complicate later 
reconfigurations . 

Another important objective is the achievement of 
portability through encapsulation of important system 
structures. This is important when migrating to a new 

30 hardware or software environment because the client 
applications are insulated from transport and access 
protocols that may be changing. By isolating the required 
changes in a small portion of the system (the DCC) , the 
applications can be ported virtually unchanged and the 

35 investment in the application software is protected. 

Efficiency is achieved by the DCC because it is coded 
on top of less costly "connectionless" transport protocol 
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in standard protocol suites such as TCP/IP and OSI. The 
DCC is designed to avoid the most costly problem in 
protocols, i.e., the proliferation of data "copy" 
operations. 

5 The DCC achieves these objectives by implementing a 

layer of services on top of the basic services provided by 
vendor supplied software. Rather than re-inventing basic 
functions like reliable data transfer or flow-control 
mechanisms, the DCC shields applications from the 
10 idiosyncracies of any particular operating system. 

Examples include the hardware oriented interfaces of the 
MS-DOS environment, or the per-process file descriptor 
limit of UNIX. By providing a single unified 
communication toll that can be easily replicated in many 
15 hardware and software environments, the DCC fulfills the 
above objectives. 

The DCC implements several different transmission 
protocols to support the various interaction paradigms , 
fault-tolerance requirements and performance requirements 
20 imposed by the service discipline protocols. Two of the 
more interesting protocols are the reliable broadcast and 
intelligent multicast protocols. 

Standard broadcast protocols are not reliable and are 
unable to detect lost messages. The DCC reliable 
25 broadcast protocols ensure that all operational hosts 

either receive each broadcast message or detects the loss 
of the message. Unlike many so-called reliable broadcast 
protocols, lost messages are retransmitted on a limited, 
periodic basis. 
30 The Intelligent Multicast™ protocol provides a 

reliable datastream to multiple destinations. The novel 
aspect of this protocol is that it can switch dynamically 
from point-to-point transmission to broadcast transmission 
in order to optimize the network and processor load. The 
35 switch from point-to-point to broadcast (and vice-versa) 
is transparent to higher-level protocols. This transport 
protocol allows the support of a much larger number of 
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consumers than would be possible using either point-to- 
point or broadcast alone. The protocol is built on top of 
other protocols with the DCC. 

Currently, all DCC protocols exchange data only in 
5 discrete units, i.e., messages (in contrast to many 

transport protocols. The DCC guarantees that the messages 
originating from a single process are received in the 
order sent. 

The DCC contains fault tolerant message transmission 

10 protocols that support retransmission in the event of a 
lost message. The DCC software guarantees "at-most-once" 
semantics with regard to message delivery and makes a best 
attempt to ensure " exact ly-once" semantics. 

The DCC has no exposed interface for use by 

15 application programmers. 

The distributed component 232 is coupled to a variety 
of service disciplines 234, 236 and 238. The service 
discipline 234 has the behavior which will herein be 
called Market Data Subscription Service. This protocol 

20 allows data consumers to receive a continuous stream of 
data, fault tolerant of failures of individual data 
sources. This protocol suite provides mechanisms for 
administering load-balancing and entitlement policies. 
The MDSS service discipline is service oriented in 

25 that applications calling this service discipline through 
the PMDP interface must know the service that supplies 
requested data. The MDSS service discipline does however 
support the subscription communication paradigm which is 
implemented by the Subject Addressed Subscription Service 

30 (SASS) service discipline 238 in the sense that streams of 
data on a subject will be passed by the MDSS service 
discipline to the linked application. 

The MDSS service discipline allows data consumer 
applications to receive a continuous stream of data, 

35 tolerant of failures of individual data sources. This 
protocol suite 234 also provides mechanisms for load 
balancing and entitlement policy administration where the 
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access privileges of a user or application are checked to 
insure a data consumer has a right to obtain data from a 
particular service. 

Two properties distinguish the MDSS service 
5 discipline from typical client server protocols. First, 
subscriptions are explicitly supported whereby changes to 
requested values are automatically propagated to 
requesting applications. Second client applications 
request or subscribe to a specific service (as opposed to 
10 a particular server and as opposed to a particular 

subject) . The MDSS service discipline then forwards the 
client application request to an available server. The 
MDSS service discipline also monitors the server 
connection and reestablishes it if the connection fails 
15 using a different server if necessary. 

The MDSS service discipline implements the following 
important objectives. 

Fault tolerance is implemented by program code which 
performs automatic switchover between redundant services 
20 by supporting dual or triple networks and by utilizing the 
fault tolerant transmission protocols such as reliable 
broadcast implemented in the protocol engines. Recovery 
is automatic after a server failure. 

Load balancing is performed by balance the data 
25 request load across all operating servers for a particular 
service. The load is automatically rebalanced when a 
server fails or recovers. In addition, the MDSS supports 
server assignment policies that attempts to optimize the 
utilization of scarce resources such as "slots" in a page 
30 cache or bandwidth across an external communication line. 

Network efficiency is implemented by an intelligent 
multicast protocol implemented by the distributed 
communication daemon 3 OB in Figure 15. The intelligent 
multicast protocol optimizes limited resources of network 
35 and I/O processor bandwidth by performing automatic, 
dynamic switchover from point to point communication 
protocols to broadcast protocols when necessary. For 
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example, Telerate page 8 data may be provided by point to 
point distribution to the first five subscribers and then 
switch all subscribers to broadcast distribution when the 
sixth subscriber appears. 
5 The MDSS service discipline provides a simple, easy- 

to-use application development interface that masks most 
of the complexity of programming a distributed system, 
including locating servers, establishing communication 
connections, reacting to failures and recoveries and load 

10 balancing. 

The core functions of the MDSS service discipline 
. are: get, halt and derive. The "get" call from a client 
application establishes a fault-tolerant connection to a 
server for the specified service and gets the current 

15 value of the specified page or data element. The 

connection is subscription based so that updates to the 
specified page are automatically forwarded to the client 
application. "Halt" stops the subscription. "Drive 11 
sends a modifier to the service that can potentially 

20 change the subscription. 

The MDSS service discipline is optimized to support 
page oriented services but it can support distribution of 
any type data. 

The service discipline labeled MSA, 236, has yet a 

25 different behavior. The service discipline labeled SASS, 
238, supports subject-based address subscription services. 

The basic idea behind subject based addressing and 
the SASS service discipline's (hereafter SASS) 
implementation of it is straightforward. Whenever an 

30 application requires data, especially data on a 
dynamically changing value, the application simply 
subscribes to it by specifying the appropriate subject. 
The SASS then maps this subject request to one or more 
service instances providing information on this subject. 

35 The SASS then makes the appropriate communication 
connections to all the selected services through the 
appropriate one or more protocol engines necessary to 
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communication with the servicer or servers providing the 
selected service or services. 

Through the use of subject based addressing, 
information consumers can request information in a way 
5 that is independent of the application producing the 
information. Hence, the producing application can be 
modified or supplanted by a new application providing the 
same information without affecting the consumers of the 
information. 

10 Subject based addressing greatly reduces the 

complexities of programming a distributed application in 
three ways. First, the application requests information 
by subject, as opposed to by server or service. 
Specifying information at this high level removes the 
15 burden on applications of needing to know the current 
network address of the service instances providing the 
desired information. It further relieves the application 
of the burden or knowing all the details of the 
communication protocols to extract data from the 
20 appropriate service or services and the need to know the 
details of the transport protocols needed to traverse the 
network. Further, it insulates the client applications 
from the need for programming changes when something else 
changes like changes in the service providers, e.g., a 
25 change from IDN to Ticker 3 for equity prices. All data 
is provided through a single, uniform interface to client 
applications. A programmer writing a client application 
needing information from three different services need not 
learn three different service specific communication 
30 protocols as he or she would in traditional communication 
models. Finally, the SASS automates many of the difficult 
and error prone tasks such as searching for an appropriate 
service instance and establishing a correct communication 
connection. 

35 The SASS service discipline provides three basic 

functions which may be invoked through the user interface. 
"Subscribe" is the function invoked when the consumer 
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requests information on a real-time basis on one or more 
subjects. The SASS service discipline sets up any 
necessary communication connections to ensure that all 
data matching the given subject (s) will be delivered to 
5 the consumer application. The consumer can specify that 
data be delivered either asynchronously (interrupt-driven) 
or synchronously. 

The producer service will be notified of the 
subscription if a registration procedure for its service 

10 has been set up. This registration process will be done 
by the SASS and is invisible to the user. 

The "cancel" function is the opposite of "subscribe". 
When this function is invoked, the SASS closes down any 
dedicated communication channel and notifies the producer 

15 service of the cancellation if a registration procedure 
exists. 

The "Receive" function and "callback" function are 
related functions by which applications receive messages 
matching their subscriptions. Callbacks are asynchronous 

20 and support the event driven programming style. This 

style is well suited for applications requiring real time 
data exchange. The receive function supports a 
traditional synchronous interface for message receipt. 
A complementary set of functions exists for a data 

25 producer. Also, applications can be both data producers 
and data consumers. 

Referring to Figure 17 there is shown a typical 
computer network situation in which the teachings of the 
invention may be profitably employed. The computer 

30 network shown is comprised of a first host CPU 300 in 
Houston coupled by a local area network (hereafter LAN) 
302 to a file server 304 and a gateway network 
interconnect circuit 306. The gateway circuit 306 
connects the LAN 302 to a wide area network (hereafter 

35 WAN) 308. The WAN 308 couples the host 300 to two servers 
310 and 312 providing the Quotron and Marketfeed 2000 
services, respectively, from London and Paris, 
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respectively. The WAN 308 also couples the host 300 to a 
second host CPU 314 in Geneva and a server 316 in Geneva 
providing the Telerate service via a second LAN 318* Dumb 
terminal 320 is also coupled to LAN 318. 
5 Typically the hosts 300 and 314 will be multitasking 

machines, but they may also be single process CPU's such 
as computers running the DOS or PC-DOS operating systems 
The TIB communication interface software supplied herewith 
as Appendix A embodies the best mode of practicing the 

10 invention and is ported for a Unix based multitasking 
machine. To adapt the teachings of the invention to the 
DOS or other single task environments requires that the 
TIB communication daemon 3 0B in the process architecture 
be structures as an interrupt driven process which is 

15 invoked, i.e., started upon receipt of a notification from 
the operating system that a message has been received on 
the network which is on a subject to which one of the 
applications has subscribed. 

The LAN's 302 and 318, WAN 308 and gateway 306 may 

20 each be of any conventional structure and protocol or any 
new structure and protocol developed in the future so long 
as they are sufficiently compatible to allow data exchange 
among the remaining elements of the system. Typically, 
the structures and protocols used on the networks will be 

25 TCP/IP, DECNET™ 4 , ETHERNET**, token ring, ARPANET and/or 
other digital pack or high speed private line digital or 
analog systems using hardwire, microwave or satellite 
transmission media. Various CCITT recommendations such as 
X.l, X.2, X.3, X.20, X.21, X.24, X.28, X.29, X.25 and X.75 

30 suggest speeds, user options, various interface standards, 
start-stop mode terminal handling, multiplex interface for 
synchronous terminals, definitions of interface circuits 
and packet-network Interconnection, all of which are 
hereby incorporated by reference. A thorough discussion 

35 of computer network architecture and protocols is included 
in a special issue of IEEE Transactions on Communications, 
April 1980, Vol. COM-28, which also is incorporated herein 
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by reference. Most digital data communication is done by 
characters represented as sequences of bits with the 
number of bits per character and the sequence of 0's and 
l's that correspond to each character defining a code. 
5 The most common code is International Alphabet No, 5 which 
is known in the U.S. as ASCII. Other codes may also be 
used as the type of code used is not critical to the 
invention. 

In coded transmission, two methods of maintaining 

10 synchronism between the transmitting and receiving points 
are commonly used. In "start-stop" transmission, the 
interval between characters is represented by a steady 1 
signal, and the transmission of a single 0 bit signals the 
receiving terminal that a character is starting. The data 

15 bits follow the start bit and are followed by a stop 

pulse. The stop pulse is the came as the signal between 
characters and has a minimum length that is part of the 
terminal specification. In the synchronous method, bits 
are sent at a uniform rate with a synchronous idle pattern 

20 during intervals when no characters are being sent to 
maintain timing. The synchronous method is used for 
higher speed transmission. 

Protocols as that term is used in digital computer 
network communication are standard procedures for the 

25 operation of communication. Their purpose is to 

coordinate the equipment and processes at interfaces at 
the ends of the communication channel. Protocols are 
considered to apply to several levels. The International 
Organization for Standardization (ISO) has developed a 

30 seven level Reference Model of Open System Interconnection 
to guide the development of standard protocols. The seven 
levels of this standard hereafter referred to as the ISO 
Model and their functions are: 

(1) Application: Permits communication between 
35 applications. Protocols here 

serve the needs of the end user. 

(2) Presentation: Presents structured data in 

proper form for use by 
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(3) Session; 



(4) Transport: 



(5) Network: 



20 (6) Data Link: 



application programs. Provides a 
set of services which may be 
selected by the application layer 
to enable it to interpret the 
meaning of data exchanged. 

Sets up and takes down 
relationships between 
presentation entities and 
controls data exchange, i.e., 
10 dialog control. 

Furnishes network- independent 
transparent transfer of data. 
Relieves the session layer from 
any concern with the detailed way 
15. in which reliable and cost- 

effective transfer of data is 
achieved. 

Provides network independent 
routing, switching services. 

Gives error-free transfer of data 
over a link by providing 
functional and procedural means 
to establish, maintain and 
release data links between 
25 network entities. 

Provides mechanical, electrical, 
functional and procedural 
characteristics to establish, 
maintain, and release physical 
30 connections, e.g., data circuits 

between data link entities. 

Some data link protocols, historically the most 
common, use characters or combinations of characters to 
control the interchange of data. Others, including the 

35 ANSI Advanced Data Communication Control Procedure and its 
subsets use sequences of bits in predetermined locations 
in the message to provide the link control* 

Packet networks were developed to make more efficient 
use of network facilities than was common in the circuit- 

40 switched and message-switched data networks of the 

mid-60 's. In circuit-switched networks, a channel was 
assigned full time for the duration of a call. In 
message-switched networks, a message or section of a 
serial message was transmitted to the next switch if a 



(7) Physical: 
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path (loop or trunk) was available. If not, the message 
was stored until a path was available. The use of trunks 
between message switches was often very efficient. In 
many circuit-switched applications though, data was 
5 transmitted only a fraction of the time the circuit was in 
u?e. In order to make more efficient use of facilities 
and for other reasons, packet networks came into 
existence. 

In a packet network, a message from one host or 

10 terminal to another is divided into packets of some 
definite length, usually 128 bytes. These packets are 
then sent from the origination point to the destination 
point individually. Each packet contains a header which 
provides the network with the necessary information to 

15 handle the packet. Typically, the packet includes at 

least the network addresses of the source and destination 
and may include other fields of data such as the packet 
length, etc. The packets transmitted by one terminal to 
another are interleaved on the facilities between the 

20 packets transmitted by other users to their destinations 
so that the idle time of one source can be used by another 
source. Various network contention resolution protocols 
exist to arbitrate for control of the network by two or 
more destinations wishing to send packets on the same 

25 channel at the same time. Some protocols utilize multiple 
physical channels by time division or frequency 
multiplexing . 

The same physical interface circuit can be used 
simultaneously with more than one other terminal or 

30 computer by the use of logical channels. At any given 
time, each logical channel is used for communication with 
some particular addressee; each packet includes in its 
header the identification of its logical channel, and the 
packets of the various logical channels are interleaved on 

35 the physical- interface circuit. 

At the destination, the message is reassembled and 
formatted before delivery to the addressee process. In 
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general, a network has an internal protocol to control the 
movement of data within the network. 

The internal speed of the network is generally higher 
than the speed of any terminal or node connected to the 
5 network. 

Three methods of handling messages are in common use. 
"Datagrams" are one-way messages sent from an originator 
to a destination. Datagram packets are delivered 
independently and not necessarily in the order sent. 

10 Delivery and nondelivery notifications may be provided. 
In "virtual calls", packets are exchanged between two 
users of the network; at the destination, the packets are 
delivered to the addressee process in the same order in 
which they were originated. "Permanent virtual circuits" 

15 also provide for exchange of packets between two users on 
a network. Each assigns a logical channel, by arrangement 
with the provider of the network, for exchange of packet 
with the other. No setup or clearing of the channel is 
then necessary. 

20 Some packet networks support terminals that do not 

have the internal capability to format messages in packets 
by means of a packet assembler and disassembler included 
in the network. 

The earliest major packet network in the U.S. was 

25 ARPNET, set up to connect terminals and host computers at 
a number of universities and government research 
establishments. The objective was to permit computer 
users at one location to use data or programs located 
elsewhere, perhaps in a computer of a different 

30 manufacturer. Access to the network is through an 
interface message processor (IMP) at each location, 
connected to the host computer (s) there and the IMP at 
other locations. IMP'S are not directly connected to each 
other IMP. Packets routed to destination IMP'S not 

35 connected directly to the source IMP are routed through 
intervening IMP'S until they arrive at the destination 
process. At locations where there is no host, teip&inal 
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interface processors are used to provide access for dumb 
terminals. 

Other packet networks have subsequently been set up 
worldwide, generally operating in the virtual call mode. 
5 In early packet networks, routing of each packet in a 

message is independent. Each packet carries in its header 
the network address of the destination as well as a 
sequence number to permit arranging of the packets in the 
proper order at the destination. Networks designed more 
10 recently use a "virtual circuit" structure and protocol. 
The virtual circuit is set up at the beginning of a data 
transmission and contains the routing information for all 
the packets of that data transmission. The packets after 
the first contain the designation of the virtual circuit 
15 in their headers. In some networks, the choice of route 
is based on measurements received from all other nodes, of 
the delay to every other node on the network. In still 
other network structures, nodes on the network are 
connected to some or all the other nodes by doubly 
20 redundant or triply redundant pathways. 

Some networks such as Dialog f Tymshare and Telenet 
use the public phone system for interconnection and make 
use of analog transmission channels and modems to modulate 
digital data onto the analog signal lines. 

Other network structures, generally WAN's, use 
microwave and/or satellites coupled with earth stations 
for long distance transmissions and local area networks or 
the public phone system for local distribution. 

Obviously, there is a wide variety of network 
structures and protocols in use. Further, new designs for 
network and transport protocols, network interface cards, 
network structures, host computers and terminals, server 
protocols and transport and network layer software are 
constantly appearing. This means that the one thing that 
is constant in network design and operation is that it is 
constantly changing. Further, the network addresses where 
specific types of data may be obtained and the access 
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protocols for obtaining this data are constantly changing. 
It is an object of the communication interface software of 
the invention to insulate the programmer of application 
programs from the need to know all the networks and 
5 transport protocols, network addresses, access protocols 
and services through which data on a particular subject 
may be obtained. By encapsulating and modularizing all 
this changing complexity in the interface software of the 
invention, the investment in application programs may be 
10 protected by preventing network topology or protocol 

dependencies from being programmed into the applications. 
Thus, when something changes on the network, it is not 
necessary to reprogram or scrap all the application 
programs. 

15 The objectives are achieved according to the 

teachings of the invention by network communications 
software having a three-layer architecture, hereafter 
sometimes called the TIB*" software. In Figure 17, these 
three layers are identified as the information layer, the 

20 service layer and the distributed communication layer. 
Each application program is linked during the compiling 
and linking process to its own copy of the information 
layer and the service layer. The compiling and linking 
process is what converts the source code of the 

25 application program to the machine readable object code. 
Thus, for example, application program 1, 340, is directly 
linked to its own copy of layers of the software of the 
invention, i.e., the information layer 342 and the service 
layer 344. Likewise application 2, 346 is linked to its 

30 own copies of the information layer 348 and the service 
layer 350. These two applications share the third layer 
of the software of the invention called the distributed 
communication layer 352. Typically there is only one 
distributed communication layer per node (where a node is 

35 any computer, terminal or server coupled to the network) 
which runs concurrently with the applications in 
multitasking machines but which could be interrupt drivers 
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in nonmultitasking environments. 

The second host 314 in Geneva in the hypothetical 
network of Figure 17 is running application program 3, 
354. This application is linked to its copies of the 
5 information layer 356 and the service layer 358, A 

concurrently running distributed communication layer 360 
in host 2 is used by application 354. 

Each of the servers 310, 312 and 316 have a data 
producer versions of the 3 layer TIB™ software. There is 

10 a data consumer version of the TIB™ software which 

implements the "subscribe' function and a data producer 
version which implements the "publish" function. Where a 
process (a program in execution under the UNIX™ 1 
definition) is both a data consumer and a data publisher, 

15 it will have libraries of programs and interface 

specifications for its TIB™ software which implement both 
the subscribe and publish functions. 

Each of the hosts 300 and 314 is under the control of 
an operating system, 370 and 372, respectively, which may 

20 be different. Host 1 and host 2 may also be computers of 
different manufacturers as may servers 310, 312 and 316. 
Host 1 has on-board shared memory 374 by which 
applications 340 and 346 may communicate such as by use of 
a UNIX™ pipe or other interprocess communication 

25 mechanism. Host 2 utilizes memory 378. 

In a broad statement of the teachings of the 
invention, the information layer, such as 342, 
encapsulates the TIBINFO™ interface functionality, and the 
subject-based addressing functionality of the TIB™ 

30 software communication library 30A in Figure 15, of 

Figures 15 and 16. The TIBINFO interface is defined in 
Section 4 of the software specification below. TIBINFO 
defines a programmatic interface by which applications 
linked to this information layer may invoke the protocols 

35 and services of the Subject -Addressed Subscription Service 
( SASS ) component . 

Figure 18 clarifies the relationships between the 
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process architecture of Figure 15, the software 
architecture of Figure 16 and the 3 layers for the TIB™ 
software shown in Figure 17. In Figure 15, the 
communications library 3 OA is a library of programs which 
5 are linked to the application 16 which provide multiple 
functions which may be called upon by the RMDP and TIBINFO 
interfaces. Subject-Addressed Subscription Services are 
provided by subject mapper programs and service discipline 
programs in the component labeled 3 OA/ 3 OB. This component 
10 3 OA/ 3 OB also includes library programs that provide the 
common infrastructure program code which supports, i.e., 
communicates with and provides data to, the protocol 
engine programs of the TIB communication daemon 30B. 
The TIBINFO interface is devoted to providing a 
15 programmatic interface by which linked client applications 
may start and use subject-addressed subscriptions for data 
provided by data producers on the network wherever they 
may be. 

The RMDP interface provide the programmatic interface 

20 by which subscriptions may be entered and data received 
from services on the network by linked client applications 
which already know the names of the services which supply 
this data. The communication library 3 OA in Figure 15 
supplies library programs which may be called by linked 

25 client applications to implement the Market-Data- 
Subscription Service (MDSS) . This function creates 
subscription data requested by service names by setting 
up, with the aid of the daemon's appropriate protocol 
engine, a reliable communication channel to one or more 

30 servers which supply the requested data. Failures of 

individual servers are transparent to the client since the 
MDSS automatically switches to a new server which supplies 
the same services using the appropriate protocol engine. 
The MDSS also automatically balances the load on servers 

35 and implements entitlement functions to control who gets 
access to a service. The reliable communication protocols 
of the DCC library 30A/30B such as intelligent multicast 
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and reliable broadcast and the protocol engines of the 
daemon 3 OB are invoked by the MDSS library programs. More 
details are given in Section 3 of the TIB™ Specification 
given below. 

5 Referring to Figure 19, which is comprised of Figures 

19A and 19B, there is shown a flow chart for the process 
carried out, inter alia, at each of the 3 layers on the 
subscriber process side for entering a subscription on a 
particular subject and receiving data on the subject. 

10 Step 400 represents the process of receiving a request 
from a user for data on a particular subject. This 
request could come from another process, another machine 
or from the operating system in some embodiments. For 
purposes of this example assume that the request comes to 

15 application 1 in Figure 17 from a user. 

Application 1 (on the application layer or layer 1 of 
the ISO Model) then sends a "subscribe request" to 
information layer 342 in Figure 17. This process is 
represented by step 402 in Figure 19* This subscribe 

20 request is entered by calling the appropriate library 
program in the linked library of programs which includes 
the TIB-INFO interface. This subroutine call passes the 
subject on which data is requested and a pointer to the 
callback routine in the requesting process that the TIB- 

25 INFO library program on the information layer is to call 
when messages are received on this subject. 

The information layer 342 encapsulates a subject-to- 
service discipline mapping function which provides 
architectural decoupling of the requesting process as that 

30 term is defined in the glossary herein. Referring to 

steps 404 and 406 in Figure 19 and to Figure 17, the input 
to the information layer is the subject and the output is 
a call to a service discipline on the service layer 344 in 
Figure 17. The information layer includes the TIB-INFO 

35 interface and all library programs of the linked 

communications library 3 OA in Figure 15 involved with 
subject-to-service mapping. The information layer maps 
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the subject to the service or services which provide data 
on this subject as symbolized by step 404 and then maps 
this service or services to one or more service 
disciplines that encapsulate communication protocols to 
5 communicate with these services. This information layer 
then coordinates with the service discipline to assign a 
"TIB channel 11 as symbolized by step 410. 

A "TIB channel" is like an "attention: Frank Jones" 
line on the address of a letter. This TIB channel data is 

10 used to route the message to the process which requested 
data on whatever subject is assigned to that TIB channel. 
Each subject is assigned a TIB channel when a subscription 
is entered. There is a subscription list that correlates 
the subscribing processes, their network addresses, the 

15 subjects subscribed to and the TIB channel numbers 

assigned to these subjects. Data on this list is used by 
the daemon to route messages received at its port address 
to the proper requesting process. This list is also used 
on the data publisher side to cause messages on particular 

20 subjects to be routed to the port address of the machine 
on which the requesting process is running. The 
communication layer of the TIB software associated with 
the service writes the channel number data in the headers 
of packets from messages on particular subjects before 

25 these packets are transmitted on the network. At the 
receiver side, the TIB channel data in the header causes 
proper routing of the packet to the requesting process. 
The TIB channel abstraction and the routing function it 
implies is performed by the DCC library portion 30A/30B in 

30 Figure 18 which is linked to each requesting process. 

Assuming there are two such services, these services 
are then mapped by the service disciplines on the service 
layer to the servers that provide these services as 
symbolized by step 412. 

35 In one embodiment, the information layer selects and 

calls only one of the service discipline subroutines in 
the service layer as symbolized by step 406. The service 
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discipline then runs and assigns a TIB channel to the 
subscription subject as symbolized by step 408. The call 
from the information layer also passes the pointer to a 
callback routine in the information layer to be called 
5 when messages on the subject arrive. 

In alternative embodiments, the information layer may 
call all the service disciplines identified in the 
subject-to-service discipline mapping process so as to set 
up communication links with all the services. 
10 In some embodiments, the names of alternative 

services and alternative servers are passed to the 
selected service discipline or directly to the distributed 
communication layer by the information layer for use in 
setting up alternate communication links. This allows the 
15 distributed communication layer to set up an alternate 

communication link to another server in case of failure of 
the selected server or for simultaneous communication link 
setup to increase the throughput of the network. In still 
other embodiments, the requesting process can call the 
20 service layer directly and invoke the appropriate service 
discipline by doing the subject-to-service discipline 
mapping in the application itself. The data regarding 
alternate services and servers can be passed by calling a 
library subroutine in the DCC library of block 3 OA/ 3 OB in 
Figure 18 which runs and stores the data regarding the 
alternates. 

In alternative embodiments, the information layer may 
assign the TIB channel to each subject or the service 
layer may assign the TIB channel acting alone without 
coordinating with the information layer. Step 410 
represents the embodiments where the service discipline 
assigns the TIB channel number by coordinating with the 
information layer. Messages sent by data provider 
processes will have the assigned subject channel data 
included as part of their header information. 

TIB channels are used by the communication layer 
(DCC) for filtering and routing purposes. That is, the 
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daemon 3 OB in Figure 15 and protocol engines 3 OB in Figure 
18 know when a message arrives at the daemon's port 
address having particular TIB channel data in the header 
that there are outstanding subscriptions for data on this 
5 subject. The daemon process knows the channels for which 
there are outstanding subscriptions because this 
information is sent to the communication layer by the 
service layer. The daemon 3 OB stores data received from 
the service discipline regarding all TIB channels having 

10 open subscriptions. The daemon then sends any message on 
a subject having an open subscription to processes at that 
port address which have subscribed to messages on that 
subject. The daemon does not know what the subject is but 
it does know there is a match between TIB channels having 

15 open subscriptions and subjects of some of the incoming 
messages . 

Each node coupled to the computer network of Figure 
17 such as host 300 has one network interface card and one 
port address. This port address may be assigned a 

20 "logical channel" in some networks for multiplexing of the 
network card by multiple processes running simultaneously 
on the same host. These port addresses may sometimes 
hereafter also be referred to as network addresses. How 
data gets back and forth between network addresses is the 

25 responsibility of the communication layers such as layer 
352 in Figure 17 which invokes the transport layer, 
network layer, data link layer and physical layer 
functionalities of the operating systems 370 and 372, 
network interface cards (not shown) and other circuits on 

30 the network itself such as might be found in gateway 306 
and WAN 308. 

The service layer, information layer and 
communication layer are layers which are "on top" of the 
ISO model layers, i.e. , they perform services not 

35 performed on any layer of the ISO model or they perform 
"value added" services as an adjunct to services performed 
on an ISO model layer. 
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The purpose of the service layer is, among other 
things, to provide service decoupling as that term is 
defined in the glossary herein- Service decoupling frees 
the application of the need to know the whereabouts on the 
5 network of servers providing a service and the details of 
how to communicate. To perform this function, the service 
layer includes a program or function to map the selected 
service to all servers that provide this service and pick 
one (step 412 in Fig. 19) . The service layer then maps 
10 the selected server to a protocol engine that encapsulates 
the communication procedures or protocols necessary to 
traverse the network, i.e., set up a data link regardless 
of the path that needs to be followed through the network 
to get to this server, and communicate with the selected 
15 server regardless of what type of machine it is, what type 
of network and network card it is coupled to and what 
operating system this server runs. This process is 
symbolized by step 414. 

In alternative embodiments , the application or 
20 subscribing process may call the protocol engine directly 
by having done its own subject based addressing and 
encapsulating its own communication protocol. In other 
alternative embodiments, the service layer will select all 
the servers supplying a service and request the 
25 communication layer to set up data links with all of them 
simultaneously to increase the throughput of the network 
or to use one server and switch to another upon failure of 
the selected server. 

Normally all services that provide a selected service 
30 are assumed to use the same communication protocol so a 
single service discipline can communicate with them all. 
However, if different instances of the same services or 
different services providing data on the same subjects use 
different communication protocols, the teachings of the 
35 invention contemplate subclasses of service disciplines. 
This means that the information layer will call a generic 
service discipline which contains code which can be shared 



WO 92/07324 



PCT/US91/07652 



by all subclasses of this service discipline to do common 
functions such as subscribe or cancel which are done the 
same way on all servers that provide this service. The 
generic service discipline will then map the subscription 
5 request to one or more of the different servers that 

provide the service. The service discipline (s) code which 
encapsulates the communication procedure peculiar to the 
selected server (s) is then called and runs to finish the 
process of setting up the subscription data stream with 
10 the selected server (s) . 

The output of the service layer is a request to the 
communication layer to the effect , "set up a communication 
link by whatever means you deem most appropriate with the 
following server or services on the following subject 
15 channel," The service layer also sends a pointer to the 
service layer callback routine which will handle messages 
or packets on the requested subject. This process is 
symbolized by step 416. In some embodiments the network 
addresses of all servers that run service processes 
20 supplying data on the requested subject are passed to a 
DCC library program which stores them for use in providing 
a reliable communication link by providing failure 
recovery in case the selected server crashes. 

Step 418 represents the process where the selected 
25 protocol engine sets up the requested data link by 
invoking selected functions of the transport layer 
protocols encapsulated in the operating system. These 
protocols invoke other communication protocols on the 
network, data link and physical layers so as to set up the 
30 requested data link and log onto the service as symbolized 
by step 420. The service layer service discipline usually 
then sends a message to the service notifying it of the 
subscription and the subject channel assigned to this 
subject as symbolized by step 422. The subject channel is 
35 noted by the information service and/or communication 

layer of the TIB interface software linked to the service. 
This allows the subject channel data to be added to the 



WO 92/07324 



PCT/US91/07652 



packet headers of transmitted packets on the subject of 
interest. This subscription message starts the flow of 
data in some embodiments, while in other embodiments, the 
flow of data starts when the data link to the server is 
5 first established. 

In some embodiments, a single subscription may * 
necessitate calling multiple services, so the information 
layer may map the subject to multiple service disciplines. 
These in turn map the request to multiple protocol engines 
10 which simultaneously set up data links to the multiplier 
services. 

In some alternative embodiments, the service 
disciplines talk directly to the transport layer and 
encapsulate the protocols necessary to communicate on the 
15 current network configuration. In these embodiments, the 
service layer may filter incoming messages by subject 
before calling the callback routine in the information 
layer. 

On small networks an alternate embodiment is to 

20 broadcast on the network subscription requests to 

particular subjects. Services coupled to the network 
listen to these broadcasts and send messages on the 
subjects of interest to the port addresses identified in 
the broadcasts. These messages are then directed by the 

25 DCC layer at the port address to the requesting process in 
the manner described elsewhere herein. 

In alternative embodiments, the service layer also 
performs other functions such as: regulating access to 
certain services; session management in the traditional 

30 sense of the session layer of the ISO model; replication 
management of replicated services and servers; 
failure/recovery management in case of failure of a 
service; distribution management; load balancing to 
prevent one server or service from being inequitably 

35 loaded with data requests when other services/ servers can 
fill the need; or, security functions such as providing 
secure, encoded communications with a server. Of 
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particular importance among these alternate embodiments 
are the embodiments which encapsulate service recovery 
schemes on the service layer. In these embodiments, when 
a server goes down, a recovery scheme to obtain the same 
data elsewhere encapsulated in the appropriate service 
discipline is run to re-establish a new data link to an 
alternate server as symbolized by step 424. 

In the preferred embodiment, the service discipline 
assigns the TIB channel to the subject and picks the 
protocol engine to use in terms of the characteristics of 
the server and the service to be accessed and the network 
and network protocol to be used, and in light of the 
degree of reliability necessary. 

The daemon 30B in Figures 15 and 18 can include many 
15 different protocol engines, each of which has different 
characteristics. For example there may be a protocol 
engine for point-to-point communication between nodes 
having Novell network interface cards and using the Novell 
protocol and a protocol engines for point-to-point 
20 communications between nodes using the TCP and UDP 

protocols and associated network interface cards. There 
may also be a protocol engine for communication wit high 
speed data publishers using reliable broadcast, and a 
protocol engine for either point-to-point or reliable 
broadcast communication using the Intelligent Multicast" 
protocol. There can be as many protocol engines as there 
are options for communication protocols, types of servers 
and services and reliability options as are desired, and 
more can be added at any time. 

Further , some of the service disciplines may be 
procedures for communicating with other processes on the 
same machine such as the operating system or another 
application or directly with a user through a terminal. 
More service disciplines can be added at any time to 
35 communicate with new sources of information. 

A service discipline, when it receives a subscription 
request may open a specific TIB channel for that subject 
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or allow any arbitrary TIB channel to be used. 

The selected service discipline or disciplines pick 
the protocol engine that has the right characteristics to 
efficiently communicate with the selected service by 
5 calling a DCC library program. The DCC library programs 
updates the subscription list with the new subscription 
and channel data and send a message to the selected 
protocol engine via shared memory or some other inter- 
process transfer mechanism. If the host is not 

10 multitasking, the daemon will be caused to run by an 
interrupt generated by the DCC library program. The 
message to the selected protocol engine will be as 
previously described and will include the identity of the 
selected server. The protocol engine will map the 

15 identity of this server to the network address of the 
server and carry out the communication protocol 
encapsulated within the selected protocol engine to set up 
the data link. Some of these protocols are value added 
protocols to, for example, increase the reliability of 

20 generic transport layer broadcast protocols or to do 
intelligent multicasting. These value added protocols 
will be detailed below. This step is symbolized by step 
426. 

The distributed communication layers 352 and 360, 
25 function to provide configuration decoupling. This 

eliminates the need for the requesting process to know how 
to do various communication protocols such as TCP, UDP, 
broadcast etc and to have code therein which can implement 
these protocols. The protocol engines implement various 
30 communication protocols and the DCC library implements the 
notion of TIB channels and performs routing and filtering 
by subject matter based upon the TIB channel data in the 
packet headers of incoming packets. The protocol engine 
for communicating using the UDP protocol also does message 
35 disassembly into packets on the service or transmit side 
and packet reassembly into complete messages on the 
subscribing process or receive side. This is a value 
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added service since the UDP transport protocol does not 
include these disassembly and reassembly functions. The 
TCP transport protocol includes these message disassembly 
and packet reassembly functions so the protocol engine 
5 that invokes this transport layer function need not supply 
these type value added services. 

In some embodiments of the invention, the UDP 
protocol engine adds sequence numbers and data regarding 
how many packets comprise each complete message to the 

10 packet headers. This allows the daemon or DCC library of 
the receiving process TIB communication layer to check the 
integrity of the message received to insure that all 
packets have been received. 

As data packets come in from the network, they are 

15 passed up through the DCC library, service layer and 

information layer to the subscribing process. The service 
layer in some embodiments may filter the incoming messages 
by subject matter instead of having this filtering done by 
the daemon or the DCC library as in other embodiments, in 

20 still other embodiments, the filtering by subject matter 
is done by the information layer. 

In some embodiments, the service layer also performs 
data vormatization by calling programs in the TIB FORMS 
interface 231 in Figure 15 or the TIB Forms Library 32 in 

25 Figure 15. 

In some embodiments, the subject based addressing is 
done by collecting all the information a subscribing 
process could ever want in a gigantic data base and 
organizing the data base by subject matter with updates as 

30 data changes. The service layer would then comprise 
routines to map the subject request to data base access 
protocols to extract data from the proper areas of the 
data base. The communication layer in such embodiments 
maps incoming update data to update protocols to update 

35 the appropriate date in the data base. 

The preferred embodiment implements the more powerful 
notion of allowing the data sources to be distributed. 
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This allows new servers and services to be coupled to the 
system without wrecking havoc with all the existing 
application software. The use of the information, service 
and communication layers of the TIB software according to 
5 the teachings of the invention provides a very flexible 
way of decoupling the application software from the ever 
changing network below it. 

In the preferred embodiment, the filtering by subject 
matter for point-to-point protocols is done by the TIB 

10 software on the transmit side. Note that in Figure 17, 
the servers 310, 312 and 316 are decoupled from the 
network by TIB interface software symbolized by the blocks 
marked "TIB IF". Terminal 320 is also decoupled in the 
same manner and can be a service for manual entry of data 

15 by the user. Specifically, this filtering is done by the 
information layer bound to the service which is publishing 
the data. For a service that is using the broadcast 
transport protocol, the TIB communication layer at the 
network addresses receiving the broadcast would filter out 

20 all messages except those having subject matching open 
subscriptions by comparing the TIB channel data to the 
channel data for open subscriptions listed in the 
subscription table based upon subscription data generated 
by the information layer and TIB channel data generated by 

25 the service layer. Note that where a service simply 

broadcasts data, the service discipline for accessing that 
service can be as simple as "listen for data arriving at 
the following network address and filter out the messages 
on other than the subscribed subject •" The service 

30 discipline would then format the data properly by invoking 
the proper function in the TIB Forms Library and pass the 
data through the information layer to the requesting 
process. 

The use of the communication layer allows all the 
35 network configuration parameters to be outside the 
applications and subject to revision by the system 
administrator or otherwise when the network configuration 
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changes. This insulates the application software from the 
network interface and provides a functionality similar to 
and incorporating at least all the functionality of the 
ISO Model network layer. 
5 Note also that in some embodiments, the functionality 

of the information, service and communication layers could 
also be easily implemented in hardware rather than the 
software of the preferred embodiment. The service and 
communication layers implement most of the functionality 
10 the ISO Model Network, Data Link and Physical layers plus 
more. 

In some embodiments, the distributed communication 
layer only receives a general request from the service 
layer to set up a data link and decides on its own which 

15 is the most efficient protocol to use. For example, the 
Dec may receive 5 separate subscriptions for the same 
information. The DCC may elect on its own to set up 5 
separate data links or bundle the requests, set up one 
data link and distribute the arriving message by 

20 interprocess transfers to each of the 5 requesting 

processes . In other embodiments , the DCC may act on its 
own to decide which protocol to use, but may accept things 
from the service layer such as, "I want this fast" or W I 
want this reliable", in the latter case, the 

25 communication layer may elect to send two subscriptions 
for the same information to two different services or may 
set up two different links to the same service by 
different network paths. 

In the preferred embodiment, the DCC library portion 

30 of the communication library serves the sold function of 
determining how to best get data from one network address 
to Mother. All replication management and failure 
recovery protocols are encapsulated in the service 
disciplines. 

35 Referring to Figure 20, comprised of Figures 2 OA and 

2 0B, there is shown a flow chart for the processing 
involved at the three layers of the TIB software on the 
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transmit side in creating a subscription data stream at a 
publishing process or service and sending it down through 
the TIB software and across the network to the subscribing 
process* 

5 Step 430 represents the process whereby the selected 

service receives a message from a subscribing process and 
initiates a data stream. Each service such as the Quotron 
service running on server 310 in Figure 17 and the 
Marketfeed 2000 and Telerate services running on servers 

10 312 and 316, respectively, are decoupled from the network 
by a version of the three layer architecture TIB software 
according to the teachings of the invention. This is 
symbolized by the blocks marked TIB IF in these server 
boxes which stands for TIB interface software. 

15 The TIB interface for each service decouples the 

service from any requirement to have functionality capable 
of supporting filtering or subject based addressing. 
Thus, if a service is designed to broadcast all equity 
prices on the American Stock Exchange and Over-the-counter 

20 market, but the subscription is simply for IBM equity 

prices, the service responds as it always has and need not 
have a function to filter out only IBM equity prices. The 
service discipline for this type service will be adapted 
to filter out all messages except IBM equity prices in 

25 response to such a subscription request. 

Another service like Telerate which publishes many 
different pages organized by subject matter, e.g., a page 
on Government T-Bill rates, a page on long term corporate 
bond rates etc. , will be able to accept a subscription for 

30 only a specific page and may be able to accept special 
commands to caused the service to publish only specific 
columns on a particular page. In such a case, the service 
layer bound to such a service will include a service 
discipline which receives subscription requests by subject 

35 and filters messages out from a broadcast that do not 
pertain to a subject having an open subscription. 

Step 430 also represents the process of the service 
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calling the TIB-Publish function of the information layer 
TIB-INFO library and starting the flow of data toward the 
subscribing process. The service need not have any of its 
own ability to filter by subject. The subscription 
5 request it receives is in the "native tongue" that this 
service understands because it is formatted and sequenced 
in the native tongue by the service discipline of the 
subscribing process. 

Most of the filtering by subject matter is done by 
10 the service disciplines, but where this filtering is done 
depends upon the type of service. Some services publish 
only one type of data so everything such a publisher puts 
out is of interest to the subscribing process. For 
example, assume that the service accessed is the real time 
15 clock 371 in Figure 17 which puts out only the current 
time and assume that the subject of the subscription is 
"give me the time of day", in such a case, the service 
discipline is very simple and no filtering need occur. 
Such a service discipline can be simply a protocol to 
20 determine how to communicate the data to the requesting 
process and what TIB channel to assign to it. 

The fact that the service starts sending data in 
whatever manner such a service normally sends data is 
symbolized by step 432. Thus, if the service is Telerate, 
25 it can send the page image and updates for any one of a 
number of different pages and it understands a 
subscription for only one of its many pages whereas the 
Quotron service would not understand a subscription for 
only IBM equity prices. The various service disciplines 
30 of the service layer provide, inter alia, the necessary 
functionality which the service does not have. 

step 432 assumes a service which broadcasts messages 
on many different subjects and a subscription request for 
only one or a few of those subjects. In other 
35 hypothetical examples, the service may publish only the 
requested information such as a particular telerate page. 
In the Telerate case, the subscription request may specify 
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that only a particular page and particular columns of that 
page be sent and may request the page image by a point-to- 
point communication protocol using a dedicated TIB 
channel. 

5 Step 434 represents the response processing of the 

service layer to the subscription request and the stream 
of data that results. In step 434, the service discipline 
does any necessary filtering by subject matter and assigns 
the TIB channel number. The filtering by subject matter 

10 is generally done by the service discipline on the data 
producer side of an exchange only when the producer 
produces vastly more data than it is called for by the 
subscription such as in the case of a high speed, 
broadcast producer. In such a case, the extraneous data 

15 could overwhelm the network. The TIB channel numbers are 
assigned by the service discipline in step 434 but they 
are not actually added to the headers for the packets 
until the message reaches the communication layer. In 
some alternative embodiments, the TIB channel numbers may 

20 be written to the packet headers by the service 
discipline. 

The TIB channel number assignment is done by the 
service discipline based upon the type of service, 
subscription and communication protocol being used. Where 

25 a broadcast protocol is being used, the service discipline 
in some embodiments will, in step 434, simply assign 
different TIB channel numbers to different subjects and 
send a message to subscribers listed in a subscription 
table maintained by the service layer on the information 

30 layer. The message will say simply, for example, for 
updates on IBM equity prices, monitor TIB channel 100. 
Note that the TIB channel data is used by the TIB software 
of the receiving host solely to route messages to the 
proper subscribing processes. TIB channels have nothing 

35 to do with logical channels, network routing or other 
network, data link or physical layer issues. 

In other embodiments, in the broadcast protocol 
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situation, the service discipline will consult a 
subscription list and filter out all messages on subjects 
other than subjects with open subscriptions. For those 
subjects, TIB channel will be assigned and a message will 
5 be sent to the TIB software linked to the subscribing 
processes as to what TIB channels to listen to for 
messages to be routed to their client processes., 
In the case of point-to-point protocols, the 
subscription requests usually contain the TIB channel 
10 numbers assigned to the subject by the service discipline 
selected by the information layer linked to the 
subscribing process. In such a case, step 434 represents 
the process of assigning the TIB channel number received 
in the subscription request to messages emitted from the 

15 service. In a typical case of subscription to a Telerate 
page which specifies that a particular TIB channel is to 
be used in case a point-to-point protocol is selected, the 
service discipline will send the page image by selecting a 
point-to-point protocol engine. The service discipline 

20 will also send a message acknowledging the subscription 
and advising the TIB software of the subscribing process 
to listen to a particular TIB channel for broadcasts of 
updates to the page. The receiving TIB software then 
opens a TIB broadcast channel for the updates. 

25 Step 436 represents the processes performed by the 

DCC library after the service discipline calls it. The 
DCC library's sole function in the preferred embodiment is 
to determine the best way to send a message to any 
particular network address where the service discipline or 

30 the subscription request does not specify the 

communication protocol to be used. In some embodiments, 
the DCC library of the communication layer will accept 
suggestions from the service layer or subscription request 
as to how to send the message but may select a different 

35 protocol if this is deemed to be more efficient. 

Further, the DCC library may change the communication 
protocol being used based upon changing conditions such as 
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number of subscribers. For example, an Intelligent 
Multicast protocol may be chosen (described in more detail 
below) . In this protocol > a point-to-point protocol is 
used when the number of subscribers is below a certain 

5 cutoff number (programmable by the system administrator 
but switchover to a broadcast protocol automatically 
occurs when the number of subscribers rises above the 
cutoff number. In the preferred embodiment "high-water 11 
and "low-water 11 marks are used as will be described below. 

0 In other embodiments, any cost function may be used to set 
the switchover point based upon cost and efficiency of 
sending multiple point-to-point messages as opposed to a 
single broadcast message* 

Step 436 also represents the process of retrieving 

5 the message from local memory of the service and putting 
it into an interprocess transfer process to send it to the 
protocol engine/daemon 30B in Figure 15. 

Step 438 represents the processes carried out by the 
protocol engine of the service to transmit the messages to 

0 the subscribing processes. If the transport protocol in 
use is UDP, the protocol engine, in some embodiments, will 
do a packetizing function. This is the process of 
breaking down the message into packets and adding header 
data on the transmit side and reassembling the packets in 

5 the proper order on the receiver side. The TCP transport 
protocol does its own packetizing, so protocol engines 
that invoke this transport layer need not packet ize. 
Nonpacket protocol engines also exist for other types of 
transport protocols. 

0 The protocol engine also writes the port address of 

the machine running the subscribing process in the message 
headers nd may perform other value added services. These 
other value added services include reliable broadcast and 
Intelligent Multicasting. Reliable broadcast services 

5 will be explained below, but basically this service 
provides functionality that does not exist in current 
broadcast communication protocols to increase reliability. 
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The protocol engines have a standard programmers 
interface through which they communicate with the 
transport layer routines in the operating system. The 
steps taken by the protocol engine to invoke the transport 
5 layer functionality so as to drive the network, data-link 
and physical layer protocols in such a manner so as to 
deliver the messages to the subscribing processes. This 
process is symbolized by steps 440 and 442. Exactly what 
these steps are cannot be specified here because they are 
10 highly dependent upon the structure, configuration and 
protocol of the network as well as the interface to the 
transport layer. When any of these change, the protocol 
engines may have to be changed to accommodate the change 
to the network. This, however, prevents the need to 
15 change the application software thereby providing 
configuration decoupling. 

After the message traverses the network , it is picked 
up by the network interface card having the port address 
shared by the subscribing process. This process is 
20 symbolized by step 444. The network card buffers the 

message and generates an interrupt to the transport layer 
routine which handles incoming messages. 

Step 446 represents the process where the transport 
layer software calls the appropriate protocol- engine of 
25 the daemon 3 OB in the communication layer such as layers 
352 or 360 in Figure 17. The incoming message or packet 
will be passed to the appropriate protocol engine by some 
interprocess transfer mechanism such as shared memory. In 
the preferred embodiment, the daemon is an ongoing process 
30 running in background on a multitasking machine. In other 
embodiments, the daemon is interrupt driven and only runs 
when a message has been received or is to be transmitted. 
Step 446 also represents the packet reassembly process for 
TCP or other transport layer protocols here packet 
35 reassembly is done by the transport layer. 

Step 448 represents the processes performed by the 
protocol engine in the daemon to process and route the 
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incoming message. 

For UDP transport layer protocol engines, packet 
reassembly is done. This of course implies that the 
protocol engine of the data producer process added 

5 sequence numbers to the packet headers so that they could 
be reassembled in the proper order. Other value added 
services may then be performed such as checking all the 
sequence numbers against data which indicates the sequence 
numbers which should have arrived to determine if all the 

0 packets have been received. In some embodiments, the data 
as to the sequence numbers to expect is written into 
fields dedicated to this purpose in the packet headers. 
In other embodiments, this data is sent in a separate 
message. 

5 If any packets are missing, in these embodiments a 

message will automatically be sent by the receiving 
communication layer back to the data producer process' 
communication layer to request retransmission of any lost 
or garbled packets. This of course implies that the 

0 communication layer for the data process stores all 
packets in memory and retains them for possible 
retransmission until an acknowledgment message is received 
indicating that all packets have been successfully 
received. 

5 Step 448 also symbolizes the main function performed 

by the communication layer daemon/protocol engine in 
receiving messages. That function is routing the messages 
to the appropriate subscribing process according to the 
TIB channel information in the header. The protocol 

0 engine checks the TIB channel number in the header against 
the current subscription list sent to it by the service 
discipline. The subscription list will include pointers 
to the appropriate service discipline callback routine and 
subscribing process for messages assigned to any 

5 particular TIB channel. The protocol engine also filters 
messages by TIB channel number for embodiments in which 
messages reach the subscribing process' TIB software which 
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do not pertain to the subscribed to subject. This may 
also be done at the service layer or information layer but 
it is most efficient to do it at the communication layer. 
The protocol engine will then put the message in the 
5 appropriate interprocess transfer mechanism, usually 
shared memory or a Unix™ pipe, and generate an interrupt 
to the DCC library as symbolized by step 450. This 
interrupt will vector processing to the appropriate DCC 
library callback routine which was identified to the 
10 protocol engine by the DCC library when the subscription 
on this TIB channel and subject was opened. The DCC 
library routine so invoked is linked to and part of the 
subscribing process which initiated the subscription. The 
DCC library callback routine then retrieves the message 
15 from the interprocess transfer mechanism and stores it in 
local memory of the subscribing process. The DCC library 
callback routine then generates an interrupt to the 
service layer and passes it a pointer to the message. 
Step 452 represents the process performed by the 
20 service layer on incoming messages. The interrupt from 
the DCC library causes to run the service discipline 
callback routine identified in the original subscribe 
message passed by the service layer through the DCC 
library. The callback routine will, in some embodiments, 
25 do any data format conversions necessary and may, in other 
embodiments do subject matter filtering. Then, the 
service discipline generates an interrupt to the 
information layer which cause the callback routine of the 
information layer to run. The interrupt contains a 
30 pointer to the message. 

Step 454 symbolizes processing of incoming messages 
by the information layer. In some embodiments, the 
service layer does not guarantee that all messages 
reaching the information layer exactly match the subject 
35 for which data was requested. In these embodiments, step 
454 symbolizes the process of comparing the TIB channel 
code to the subject of the subscription to make sure they 
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match. 

Step 456 symbolizes the process of generating an 
interrupt to the callback routine of the subscribing 
process if there is a match on subject. If not, no 
5 interrupt is generated and monitoring for new messages 
continues by the daemon while all the interrupt driven 
processes terminate and release their computer resources 
until the next interrupt. 

Step 458 symbolizes the process of use of the message 

10 data for whatever purpose the subscribing process 
originally sought this data. 

Reliable broadcast is one of the value added services 
that the communication layer can use to supplement and 
improve the communication protocols of the transport 

15 layer. Traditional broadcast protocols offered by prior 
art transport layers are not reliable. For example , if 
there is noise on the line which corrupts or destroys a 
packet or message or if the network interface card 
overflows the buffer, packets or entire messages can be 

20 lost and the processes listening for the message never 
gets the message, or they get an incomplete or garbled 
message. There is no acknowledge function in traditional 
broadcast, so if some of the processes miss the message or 
get incomplete or garbled messages, the transmitting 

25 process never finds out. This can happen for one in every 
hundred packets or for one in ten packet Traditional prior 
art transport layer broadcast protocols do not include 
functionality, i.e., program code, to distribute a 
broadcast message received at the network address of the 

30 host to multiple processes running on that host. 

The communication layer according to the teachings of 
the invention includes at least one protocol engine to 
implement reliable broadcast protocols which are built on 
top and supplement the functionality of the prior art 

35 transport layer broadcast protocols. Referring to Figure 
21, there is shown a flow chart for one embodiment of a 
reliable broadcast protocol implemented by the 
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communication layer, step 500 represents the process 
where the DCC library receives a request from a service 
discipline to send a message having a particular TIB 
channel assigned thereto, in some embodiments, this 
5 request may also include a request or a command to send 
the message by the reliable broadcast protocol. In 
embodiments where the reliable broadcast protocol is 
mandated by the service discipline, the service discipline 
includes a function to determine the number of subscribers 
10 to a particular channel and determine the cost of sending 
the same message many times to all the port addresses of 
all subscribers versus the cost of sending the message 
once by broadcast with messages to all subscribers to 
listen to TIB channel XX (whatever TIB channel number was 
15 assigned to this subject) for data on the subjects they 
are interested in. In the embodiment illustrated in 
Figure 21, this cost determination function is included 
within the communication layer DCC library functionality. 
Step 502 represents this cost determination process 
20 as performed by the DCC library. The particular program 
of the DCC library which implements this function, checks 
the subscription list and counts the number of subscribers 
to the TIB channel assigned to this message. The cost of 
sending the message point-to-point to all these 
25 subscribers is then evaluated using any desired costing 
function. In some embodiments, the cost function may be a 
comparison of the number of subscribers to a predetermined 
cutoff number. The particular cost f unction used is not 
critical to the invention. The cost of sending the 
30 message to multiple subscribers point-to-point is that the 
same message must be placed repeatedly on the network by 
the data producing software. The cost of broadcasting a 
message is that all network cards pick it up and may 
interrupt the transport protocol program in the operating 
35 system of the host which transmits the message by 

interprocess transfer to the TIB daemon only to find out 
the message is not of interest to any client process 
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running on that host. Computer resources are thus wasted 
at any such host. 

Step 504 represents the process the DCC library 
carries out to evaluate the cost and decide to send the 
5 message either by point-to-point protocol or reliable 
broadcast. If it is determined that the number of 
subscribers to this TIB channel is small enough, the 
decision will be made to send the message by a 
point-to-point protocol. Step 506 represents the process 

10 of calling the point-to-point protocol engine and sending 
the message using this protocol. 

If the number of subscribers is too high for 
efficient point-to-point transmission, the DCC library 
calls the reliable broadcast protocol engine as symbolized 

15 by step 508. 

Step 510 represents the first step of the reliable 
broadcast protocol processing. The reliable broadcast 
protocol according to the teachings of the invention 
supports multiple subscribing processes running of the 

20 same host and requires that each subscribing process 

receive all the packets of the message without error and 
acknowledge receipt thereof. The insure that this is the 
case, sequence numbers must be added to the headers of 
each packet and some data must be communicated to the 

25 subscribing processes that indicate the sequence numbers 
that must all have been received in order to have received 
the entire message. In some embodiments, only the 
sequence numbers will be added to the packet headers and 
the data regarding the sequence numbers that comprise the 

30 entire message will be sent by separate message to each 
process having an open subscription to the TIB channel 
assigned to the message. In other embodiments, the 
sequence numbers that comprise the entire message will be 
added to the header of the first packet or to the headers 

35 of all the packets. The sequence numbers added to the 
packets are different than the sequence numbers added by 
packetizing functionality of the transport protocols of 
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the operating system in TCP protocols since the TIB 
sequence numbers are used only to determine is all packets 
of a message have been received. In some embodiments, the 
packet sequence numbers added by the transport protocol 
5 may be used by the TIB communication layer of the 

subscribing processes to determine if all the packets have 
been received. In other embodiments of reliable broadcast 
protocol engines for supplementing the UDP transport layer 
protocol, the packet izing function of the protocol engine 

10 adds sequence numbers which can be used both for 

transport/network/data link/physical layer functions but 
also for TIB communication layer functions in verifying 
that all packets of a message have been received. 

After the sequence numbers have been added, the 

15 packets are written to a retransmit buffer with their 
sequence numbers for storage in case some or all of the 
packets need to be retransmitted later as symbolized by 
step 512. 

Before the messages can be sent to the various 

20 subscribing processes, the reliable broadcast protocol 
engine adds the TIB channel data to the header of each 
packet and sends a message to each subscribing process 
listed in the subscription table as having open 
subscriptions for this channel to listen for data on their 

25 requested subject on TIB channel XX where XX is the TIB 
channel number assigned to this subject. 

Step 516 represents the process of transmitting the 
packets via the standard broadcast protocols of the 
transport layer by calling the appropriate operating 

30 system program and passing a pointer to each packet. 

Referring to Figure 22, there is shown a flow chart 
of the processing by the communication layer of the 
subscribing process in the reliable broadcast protocol. 
The packets broadcast on the network are picked up by all 

35 network interface cards of all hosts on the network which 
then invoke the transport protocol software of the 
operating systems of the various hosts. The transport 
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protocols then notify the daemons of the communication 
layers that a broadcast message has arrived and puts the 
packets in an interprocess transfer mechanism, usually 
shared memory. The daemons then retrieve the packets from 
5 the interprocess transfer mechanism as represented by step 
518. 

Step 520 represents the process of checking the TIB 
channel numbers of the incoming packets to determine if 
they correspond to the TIB channel of any open 

10 subscription. If they do, the reliability sequence 
numbers are checked by the reliable broadcast protocol 
engine against the data indicating which packets and 
corresponding sequence numbers should have been received 
to have received a complete message. In some embodiments, 

15 especially embodiments using transport, network, data link 
and physical layers where error checking (ECC) is not 
performed at layers below the TIB interface software of 
the invention, error detection and correction is performed 
on the packets using the ECC bits appended to the packet. 

20 If errors have occurred that are beyond the range of 
correction given the number of ECC bits present, the 
packet is marked as garbled. 

After determining which packets are missing or 
garbled, if any, the receiving protocol engine then sends 

25 a message back to the communication layer of the service 
or publishing process. This message will either 
acknowledge that all packets have been received without a 
problem or will request that certain packets be 
retransmitted. This is symbolized by step 522. 

30 Step 524 represents the process of retransmission of 

the missing or garbled packets by the communication layer 
of the data producing process or service* In some 
embodiments, the missing or garbled packets will be sent 
point-to-point to only the subscribing that did get them. 

35 In other embodiments, the missing or garbled packets are 
broadcast to nodes with notification messages being sent 
to the subscribing processes that need them to listen on 
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10 



15 



TIB channel XX where XX is the TIB channel on which the 
packets will be broadcast. The phrase "listen to channel 
XX" as it is used here has nothing to do with the actual 
transmission frequency, timeslot or other physical 
characteristic of the transmission, it merely means that 
the missing or garbled packets will be appearing on the 
network shortly and will have TIB channel XX routing 
information in their header data. 

Step 526 represents the process of checking by the 
receiving communication layer that the replacement packets 
have been properly received similar to the processing of 
step 520. If they have, the receiving communication layer 
acknowledges this fact to the communication layer of the 
service, if not, a request for retransmission of the 
missing or garbled packets is again sent to the 
communication layer of the transmitting process, 
retransmission ensues and the whole process repeats until 
all packets have been successfully received. The final 
acknowledge message from the receiving communication layer 
to the transmitting communication layer that all packets 
have been successfully received causes the reliable 
broadcast protocol engine of the transmitting 
communication layer to flush all the packets from the 
retransmission memory as symbolized by step 528. 

Step 530 represents the routing process where the 
reliable broadcast protocol engine checks the TIB channel 
data against the subscription list to determine which 
client processes have requested data assigned to this TIB 
channel. Once this information is known, the protocol 
30 engine passes a pointer to the message to all service 
disciplines which have entered subscriptions for data on 
this TIB channel. In some embodiments, the protocol 
engine with place a copy of the message in a separate 
interprocess transfer mechanism for every subscribing 
35 process. In other embodiments, shared memory will be the 
interprocess transfer mechanism and a pointer to the same 
copy of the message will be sent to all subscribing 
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processes. The subscribing processes will then arbitrate 
for access to the message in the information layer or the 
service layer. 

Step 532 represents the processes previously 
5 described of passing the message up through the service 
and information layers to the subscribing process by 
successive interrupts causing to run the callback routines 
designated when the subscription was entered. Filtering 
by subject matter may also occur in some embodiments at 

10 the service layer and/ or the information layer to 
guarantee a match to the subscribed subject. 

Figure 23 is a flow chart of processing to transmit 
data by the Intelligent Multicast communication protocol. 
This protocol uses either point-to-point or reliable 

15 broadcast protocols for each message depending upon the 

subject matter and how many subscriptions are open on this 
subject. The choice of protocol is automatically made for 
eacn message depending upon how many subscribing 
processes/network addresses there are for the message at 

20 the time the message is published. If the number of 
subscribers for a subject changes sufficiently, the 
transmission protocol may change automatically. 

Step 600 represents the process of receiving a 
subscription request at the service layer of the data 

25 publishing process, passing this subscription along to the 
subscribing process and making an entry for a new 
subscription in the subscription table. 

In step 602, a message is published by the service 
through the information layer to the service layer. The 

30 subject of the message may or may not be on the subject 
for the new subscription was just entered. The service 
layer examines the subject data forwarded by the 
information layer about the message and coordinates with 
the information layer to assign a TIB channel to the 

35 subject if the TIB channel was already assigned by the 
service and information layers of the subscribing process 
as symbolized by step 604. 
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In step 606, the service discipline compares the 
number of subscribers for the subject of the message to a 
programmable cutoff number which is based upon the cost of 
transmission point-to-point versus the cost of 
5 transmission by reliable broadcast. The programmable 
cutoff number can be set and altered by the system 
administrator and is based upon any desired cost function, 
the nature of which is not critical to the invention. In 
the preferred embodiment, the cost function is comprised 

10 of a high water mark and a low water mark. If the number 
of subscribers is above the high water mark, the message 
will be sent by reliable broadcast. If the number of 
subscribers to this subject then subsequently falls below 
the low water mark, subsequent messages will be sent 

15 point-to-point. In some embodiments, the cost function 
can be an automatic learning program that listens to the 
network and subscription requests and makes the decision 
based upon latency time or some other criteria of network 
efficiency. 

20 step 608 represents the process of calling the 

reliable broadcast protocol engine if the number of 
subscribers is greater than the cutoff number. The 
message is then put in an interprocess transfer mechanism 
directed to this protocol engine. 

25 if the number of subscribers is below the cutoff 

number, point-to-point transmission is more efficient so 
the service discipline calls the point-to-point protocol 
engine and puts the message into an interprocess transfer 
mechanism directed to this protocol engine as symbolized 

30 by step 610. 

Step 612 represents the process of waiting for the 
next message or subscription and returning to step 600 if 
a subscription is received and to step 602 if another 
message is received. 

35 in summary the concept of the invention is to use 

software layers to decouple applications from the 
complexities of the computer network communication art in 
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ways that applications have never before been decoupled. 
For example, it is believed that the subject based 
addressing decoupling provided by the information layer is 
new especially when coupled with the service decoupling 
5 provided by the service layer. It is believed to be new 
to have extensible service and communication layers that 
can be easily modified by the addition of new service 
disciplines and protocol engines to provide service and 
configuration decoupling under changing conditions such as 

10 new network topologies and the addition of new or changed 
services and/ or servers to the network. It is new to have 
a service layer that includes many different service 
disciplines designed to encapsulate many varied 
communication protocols. For example, these service 

15 disciplines can handle communication with everything from 
services like Telerate to operating system programs, other 
processes on other machines (or even the same machine or 
another part of the same process even) to a user sitting 
at a terminal. Further, the abilities of this service 

20 layer to implement failure monitoring and recovery, 
distribution and replication management, and 
security /access control services is new. 

Further, it is new to have the configuration 
decoupling and value added services of the communication 

25 layer. 

The teachings of the invention contemplate use of any 
one of these layers or any combination of the three in the 
various embodiments which together define a class or genus 
of software programs the species of which implement the 

30 specific functions or combinations thereof defined herein. 
Attached hereto as Appendix A is a complete source 
code listing for the TIB™ communication interface software 
according to the teachings of the invention in the C 
programming language. Included are all library programs, 

35 all interfaces and all layers of the software for both 
data publishing and data consuming processes. Also 
included are all utility programs necessary to compile 



this software into machine readable code for a Unix™ based 
multitasking workstation. 

There follows a more detailed specification of the 
various library programs and the overall structure and 
functioning of an embodiment of the communication inter- 
face according to the teachings of the invention. 

Information Driven Architecture™, Teknekron 
Information Bus™, TIB™, TIBINFO™, TIBFORMS™, Subject-Based 
Addressing™, and RMDP™ are trademarks of Teknekron 
Software Systems, Inc. 

CONTENTS 
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5. Data-exchange Component :TIBFORM 

6. Appendix: 'man' Pages 
1. Introduction 

The Teknekron Information Bus™ software (TIB™ 
component) is a distributed software component designed to 
facilitate the exchange of data among applications 
executing in a real-time, distributed environment. It is 
built on top of industry standard communication protocols 
(TCP/IP) and data-exchange standards (e.g., X.400). 

The document is organized as follows. Section 2 
gives an architectural overview of the TIB™. Section 3 
describes the Reliable Market Data Protocol. This general 
purpose protocol is particularly well suited to the 
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requirements of the page-based market data services. It 
is also often used for bulletin and report distribution. 
Section 4 describes TIBINF0, an interface supporting 
Subject-based Addressing. Section 5 describes a component 
5 and its interface that supports a very flexible and 
extensible data-exchange standard. This component is 
called TIBFORMS. The Appendix contains (UNIX-like) manual 
pages for the core interfaces. 

2. Architectural Overview 

10 2.1 introduction 

The Teknekron Information Bus (TIB™) is comprised of 
two major components: the (application-oriented) data com- 
munication component and the data -exchange component. 
These are depicted in Figure 2.1. In addition, a set of 

15 presentation tools and a set of support utilities have 
been built around these components to assist the applica- 
tion developer in the writing of TIB^-based applications. 

The (application-oriented) data communication 
component implements an extensible framework for 

20 implementing high-level, communication protocol suites. 
Two protocol suites have been implemented that are 
tailored toward the needs of fault- tolerant, real-time 
applications that communicate via messages. Specifically, 
the suites implement subscription services that provide 

25 communication support for monitoring dynamically changing 
values over a network. Subscription services implement a 
communication paradigm well suited to distributing market 
data from, for example, Quotron or Telerate. 

One of the protocol suites supports a traditional 

30 service-oriented cooperative processing model. The other 
protocol suite directly supports a novel 
information-oriented, cooperative processing model by 
implementing subject-based addressing. Using this 
addressing scheme, applications can request information by 
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subject through a general purpose interface. 
Subject-based addressing allowing information consumers to 
be decoupled from information producers; thereby, 
increasing the modularity and extensibility of the system. 
5 The application-oriented protocol suites are built on 

top of a common set of communication facilities called the 
distributed communications component, depicted as a 
sublayer in Figure 2.1. In addition to providing reli- 
able communications protocols, this layer provides loca- 
10 tion transparency and network independence to its clients. 

The layer is built on top of standard transport-layer 
protocols (e.g., TCP/IP) and is capable of supporting 
multiple transport protocols. The data-exchange component 
implements a powerful way of representing and transmitting 
15 data. All data is encapsulated within self -describing 
data objects, called TIB™-forms or, more commonly, simply 
forms, since TIB™ forms are self -describing, they admit 
the implementation of generic tools for data manipulation 
and display. Such tools include communication tools for 
20 sending forms between processes in a machine-independent 
format. Since a self -describing form can be extended 
without adversely impacting the applications using it, 
forms greatly facilitate modular application development. 
The two major components of TIB™ were designed so 
25 that applications programmers can use them independently 
or together. For example, forms are not only useful for 
communicating applications that share data, but also for 
non-communicating applications that desire to use the 
generic tools and modular programming techniques supported 
30 by forms. Such applications, of course, do not need the 
communication services of the TIB™. Similarly, 
applications using subject-based addressing, for example, 
need not transmit forms, but instead can transmit any data 
structure. Note that the implementation of the communica- 
35 tion component does use forms, but it does not require 
applications to use them. 
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2.2 System Model 

The system model supported by the TIB™ consists of 
users, user groups, networks, services, service instances 
(or servers), and subjects, 
5 The concept of a user, representing a human 

"end-user," is common to most systems. A user is 
identified by a user-id. The TIB 1 * user-id is normally the 
same as the user- id (or logon id) supported by the 
underlying operating system, but it need not be. 

10 Each user is a member of a exactly one group. The 

intention is that group should be composed of users with 
similar service access patterns and access rights. Access 
rights to a service or system object are grantable at the 
level of users and at the level of groups. The system 

15 administrator is responsible for assigning users to 
groups. 

A network is a logical concept defined by the 
underlying transport layer and is supported by the TIB™. 
An application can send or receive across any of the 
20 networks that its host machine is attached to. It also 
supports all gateways functions and internetwork routing 
that is supported by the underlying transport-layer 
protocols . 

Since the lowest layer of the TIB™ communication 
25 component supports multiple networks, application-oriented 
protocols can be written that transparently switchover 
from one network to another in the event of a network 
failure. 

A service represents a meaningful set of functions 
30 that are exported by an application for use by its 
clients. Examples of services are an historical news 
retrieval service, a Quotron datafeed, and a trade ticket 
router. An application will typically export only one 
service, although it can export many different services. 
35 A service instance is an application process capable 

of providing the given service. (Sometimes these are 
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called "server processes.") For a given service, several 
instances may be concurrently providing it, so as to 
improve performance or to provide fault tolerance. 
Application-oriented communication protocols in the TIB™ 
5 can implement the notion of a "fault-tolerant" service by 
providing automatic switchover from a failed service 
instance to an operational one providing the same service. 

Networks, services, and servers are traditional 
components of a system model and are implemented in one 

10 fashion or another in most distributed systems. On the 
other hand, the notion of a subject is novel to the 
information model implemented by the TIB 1 *. 

The subject space consists of a hierarchical set of 
subject categories. The current release of the TIB™ sup- 

15 ports a 4 level hierarchy, as illustrated by the following 
wel 1 formed sub j ect : " equity . ibm . compos it e . trade . w The 
TIB™ itself enforces no policy as to the interpretation of 
the various subject categories. Instead, the applications 
have the freedom and responsibility to establish conven- 

20 tions on use and interpretation of subject categories. 

Each subject is typically associated with one or more 
services producing data about that subject. The 
subject-based protocol suites of the TIB™ are responsible 
for translating an application's request for data on a 

25 subject into communication connections to one or more 
service instances providing information on that subject. 

A set of subject categories is referred to as a 
subject domain. The TIB™ provides support for multiple 
subject domains. This facility is useful, for example, 

30 when migrating from one domain to another domain. Each 
domain can define domain-specific subject encoding 
functions for efficiently representing subjects in message 
headers . 

2.3 Process Architecture 

35 The communication component of the TIB™ is a truly 
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distributed system with its functions being split between 
a frontend TIB™/ communication library, which is linked 
with each application, and a backend TIB 1 */ communication 
daemon process, for which there is typically one per host 
5 processor. This process architecture is depicted Figure 
2.2. Note that this functional split between TIB™ library 
and TIB™ daemon is completely transparent to the 
application, in fact, the application is completely 
unaware of the existence of the TIB™ daemon, with the 
10 exception of certain failure return codes. 

The TIB™ daemons cooperate among themselves to ensure 
reliable, efficient communication between machines. For 
subject-addressed data, they assist in its efficient 
transmission by providing low-level system support for 
15 filtering messages by subject. 

The TIB™/communication library performs numerous 
functions associated with each of the application-oriented 
communication suites. For example, the library translates 
subjects into efficient message headers that are more 
20 compact and easier to check than ASCII subject values. It 
also maps service requests into requests targeted for 
particular service instances, and monitors the status of 
those instances. 

The data-exchange component of TIB™ is implemented as 
25 a library, called the TIB™/form library, that is linked 
with the application. This library provides all of the 
core functions of the data-exchange component and can be 
linked independently of the TIB™/communication library. 
The TIB™/form library does not require the 
30 TIB™/communication daemon. 

2.4 Communication Component 



The TIB™ Communication Component consists of 3 
subcomponents: the lower-level distributed communication 
component (DCC) , and two high-level application-oriented 
communication protocol suites-the Market Data Subscripts 
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Service (MDSS) , and the Subject-Addressed Subscription 
Service (SASS). 

The high-level protocol suites are tailored around a 
communication paradigm known as a subscription. In this 
5 paradigm, a data consumer "subscribes" to a service or 
subject, and in return receives a continuous stream of 
data about the service or subject until the consumer 
explicitly terminates the subscription (or a failure oc- 
curs) . A subscription paradigm is well suited for 

10 realtime applications that monitor dynamically changing 
values, such as a stock's price. In contrast, the more 
traditional request/ reply communication paradigm is 
ill-suited for such realtime applications, since it 
requires data consumers to "poll" data providers to learn 

15 of changes. 

The principal difference between the two high-level 
protocols is that the MDSS is service-oriented and SASS is 
subject-oriented. Hence, for example, MDSS supports the 
sending of operations and messages to services, in addi- 

20 tion to supporting subscriptions; whereas, SASS supports 
no similar functionality. 

2.4.1 Market Data Subscription Service 

2.4.1.1 overview 

MDSS allows data consumers to receive a continuous 
25 stream of data, tolerant of failures of individual data 
sources. This protocol suite provides mechanisms for 
administering load balancing and entitlement policies. 

Two properties distinguish the MDSS protocols from 
the typical client/ server protocols (e.g. RPC) . First, 
30 subscriptions are explicitly supported, whereby changes to 
requested values are automatically propagated to clients. 
Second, clients request (or subscribe) to a service, as 
opposed to a server, and it is the responsibility of the 
MDSS component to forward the client's request to an 
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available server. The MDSS is then responsible for 
monitoring the server connection and reestablishing if it 
fails, using a different server, if necessary . 

The MDSS has been designed to meet the following 
5 important objectives: 

(1) Fault tolerance. By supporting automatic 
switchover between redundant services, by explicitly 
supporting dual (or triple) networks, and by utilizing the 
fault-tolerant transmission protocols implemented in the 

10 DCC (such as the "reliable broadcast protocols"), the MDSS 
ensures the integrity of a subscription against all single 
point failures. An inopportune failure may temporarily 
disrupt a subscription, but the MDSS is designed to detect 
failures in a timely fashion and to quickly search for an 

15 alternative communication path and/ or server. Recovery is 
automatic as well. 

(2) Load balancing. The MDSS attempts to balance 
the load across all operational servers for a service* It 
also rebalances the load when a server fails or recovers. 

20 In addition, the MDSS supports server assignment policies 
that attempts to optimize the utilization of scarce 
resources such as "slots" in a page cache or bandwidth 
across an external communication line. 

(3) Network efficiency. The MDSS supports the 
25 intelligent multicast protocol implemented in the DCC. 

This protocol attempts to optimize the limited resources 
of both network bandwidth and processor I/O bandwidth by 
providing automatic, dynamic switchover from 
point-to-point communication protocols to broadcast 
30 protocols. For example, the protocol may provide 
point-to-point distribution of Telerate page 8 to the 
first five subscribers and then switch all subscribers to 
broadcast distribution when the sixth subscriber appears. 

(4) High-level communication interface. The MDSS 
35 implements a simple, easy-to-use application development 

interface that mask most of the complexities of 
programming a distributed system, including locating 
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servers, establishing communication connections, reacting 
to failures and recoveries, and load balancing. 

2.4.1.2 Functionality 

The MDSS supports the following core functions: 

5 get MDSS establishes a fault-tolerant connection to a 
server for the specified service and "gets" (i.e., 
retrieves) the current value of the specified page or 
data element. The connection is subscription based 
so that updates to the specified page are 
10 automatically forwarded. 

halt "halt" the subscription to the specified service. 

derive sends a modifier to the server that could 
potentially change the subscription. 

The MDSS protocol has been high-optimized to support 
page-oriented market data feed, and this focus has been 
reflected in the choice of function names. However, the 
protocol suite itself is quite general and supports the 
distribution of any type of data. Consequently, the 
protocol suite is useful and is being used in other 
contexts (e.g., data distribution in an electronic 
billboard) . 

2.4.2 Subject-Addressed Subscription Service (SASS) 
2.4.2.1 Overview 

The SASS is a sophisticated protocol suite providing 
25 application developers a very high-level communications 
interface that fully supports the information-oriented, 
cooperative processing model. This is achieved through 
the use of subject-based addressing. 
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The basic idea behind subject-based addressing and 
the SASS's implementation of it is straightforward. 
Whenever an application requires a piece of data, 
especially, data that represents a dynamically changing 
5 value (e.g. a stock price), the application simply 
subscribes to that data by specifying the appropriate 
subject. For example, in order to receive all trade 
tickets on IBM, an application may issue the following 
subscription: "trade_ticket.IBM". Once an application has 

10 subscribed to a particular subject, it is the 

responsibility of the SASS to choose one or more service 
instances providing information on that subject. The SASS 
then makes the appropriate communications connections and 
(optionally) notifies the service instances providing the 

15 information. 

The SASS has been designed to meet several important 
objectives: 

(1) Decoupling information consumers from information 
providers. Through the use of subject-based addressing, 

20 information consumers can request information in a way 
that is independent of the application producing the 
information. Hence, the producing application can be 
modified or supplanted by a new application providing the 
same information without affecting the consumers of the 

25 information. 

(2) Efficiency. Support for filtering messages by 
subject is built into the low levels of the TIB™ daemon, 
where it can be very efficient. Also, the SASS supports 
filtering data at the producer side: data that is not cur- 

30 rently of interest to any application can simply be 
discarded prior to placing in on the network; thereby, 
conserving network bandwidth and processor I/O bandwidth. 

(3) High-level communication interface. The SASS 
interface greatly reduces the complexities of programming 

35 a distributed application in three ways. First, the 

consumer requests information by subject, as opposed to by 
server or service. Specifying information at this level 
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is easier and nor natural than at the service level. 
Also, it insulates the program from changes in service 
providers (e.g., a switch from IDN to Ticker 3 for equity 
prices) . Second, the SASS presents all data through a 
5 simple uniform interface-a programmer needing information 
supplied by three services need not learn three 
service-specific protocols, as he would in a traditional 
processing model. Third, the SASS automates many of the 
hard or error-prone tasks, such as searching for an 
10 appropriate service instance, and establishing the correct 
communication connection. 

2.4.2.2 Functionality 

For a data consumer, the SASS provides three basic 
functions: 

15 subscribe where the consumer requests information on a 
real-time basis on one or more subjects. The 
SASS components sets up any necessary 
communication connections to ensure that all 
data matching the given subject (s) will be 
20 delivered to the consumer. The consumer can 

specify that data be delivered either 
asynchronously (interrupt-driven) or 
synchronously. A subscription may result in the 
producer service instance being informed of the 
25 subscription. This occurs whenever the producer 

has set up a registration procedure for its 
service. This notification of the producer via 
any specified registration procedure is 
transparent to the consumer. 
30 cancel which is the opposite of subscribe. The SASS 
component gracefully closes down any dedicated 
communication channels, and notifies the 
producer if an appropriate registration 
procedure exists for the service. 
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receive receive and "callbacks" are two different ways 
for applications to receive messages matching 
their subscriptions. Callbacks are asynchronous 
and support the event driven programming style-a 
5 style that is particularly well-suited for 

applications requiring realtime data exchange. 
"Receive" supports a traditional synchronous 
interface for message receipt. 

For a data producer, the SASS provides a complementary set 
10 of functions. 

Note that an application can be both a producer and a 
consumer with respect to the SASS, and this is not un- 
common • 

2.4.3 Distributed Communication Component 

15 2.4.3.1 Overview 

The Distributed Communication Component (DCC) 
provides communication services to higher-level TIB 1 * 
protocols, in particular, it provide several types of 
fault transparent protocols. 

20 The DCC is based on several important objectives: 

(1) The provision of a simple, stable, and uniform 
communication model. This objective offers several 
benefits. First, it offers increased programmer 
productivity by shielding developers from the complexities 

25 of a distributed environment; locating a target process, 
establishing communications with it, and determining when 
something has gone awry are all tasks best done by a 
capable communications infrastructure, not by the program- 
mer. Second, it reduces development time, not only by 

30 increasing programmer productivity, but also by simplify- 
ing the integration of new features. Finally, it enhances 
configurability by keeping applications unaware of the 
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physical distribution of other components. This prevents 
developers from building in dependencies based on a 
particular physical configuration. (Such dependencies 
would complicate subsequent reconfigurations.) 
5 (2) Portability through encapsulation of important 

system structures. This objective achieves importance 
when migration to a new hardware or software environment 
becomes necessary. The effort expended in shielding ap- 
plications from the specific underlying communication 

10 protocols and access methods pays off handsomely at that 
time. By isolating the required changes in a small por- 
tion of the system (in this case, the DCC) , applications 
can be ported virtually unchanged, and the firm's applica- 
tion investment is protected. 

15 (3) Efficiency. This is particular important in 

this component. To achieve this, the DCC builds on top of 
less costly "connectionless 11 transport protocols in 
standard protocol suites (e.g., TCP/IP and OSI) . Also, 
the DCC has been carefully designed to avoid the most 

20 costly problem in protocols: the proliferation of data 
"copy" operations. 

rhe DCC achieves these objectives by implementing a 
layer of services on top of the basic services provided by 
vendor-supplied software. Rather than re-inventing basic 

25 functions like reliable data transfer or flow-control 

mechanisms, the DCC concentrates on shielding applications 
from the idiosyncrasies of any one particular operating 
system. Examples include the hardware-oriented interfaces 
of the MS-DOS environment, or the per-process file 

30 descriptor limit of UNIX. By providing a single, unified 
communication tool that can be easily replicated in many 
hardware or software environment, the DCC fulfills the 
above objectives. 

2.4.3.2 Functionality 

35 The DCC implements several different transmission 
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protocols to support the various interaction paradigms, 
fault-tolerance requirements, and performance requirements 
imposed by the high-level protocols. Two of the more 
interesting protocols are reliable broadcast and intel- 
5 ligent multicast protocols. 

Standard broadcast protocols are not reliable and are 
unable to detect lost messages. The DCC reliable 
broadcast protocols ensure that all operational hosts 
either receive each broadcast message or detects the loss 

10 of the message. Unlike many so-called reliable broadcast 
protocols, lost messages are retransmitted on a limited, 
periodic basis. 

The intelligent multicast protocol provides a 
reliable datastream to multiple destinations. The novel 

15 aspect of the protocol is that it can dynamically switch 
from point-to point transmission to broadcast transmission 
in order to optimize the network and processor load. The 
switch from point-to-point to broadcast (and vice versa) 
is transparent to higher-level protocols. This protocol 

20 admits the support of a much larger number of consumers 
than would be possible using either point-to-point or 
broadcast alone. The protocol is built on top of other 
protocols within the DCC 

Currently, all DCC protocols exchange data only in 

25 discrete units, i.e., "messages 91 (in contrast to many 
Transport protocols) . The DCC guarantees that the mes- 
sages originating from a single process are received in 
the order sent. 

The DCC contains fault-tolerant message transmission 

30 protocols that support retransmission in the event of a 
lost message. The package guarantees "at-most-once" 
semantics with regards to message delivery and makes a 
best attempt to ensure "exactly once" semantics. 

The DCC contains no exposed interfaces for use by 

35 application developers. 

3. RELIABLE MARKET DATA PROTOCOL 
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3 . l introduction 

The Reliable Market Data Protocol (RMDP) defines a 
programmatic interface to the protocol suite and services 
comprising the Market Data Subscription Service (MDSS) 
5 TIB™ subcomponent. RMDP allows market data consumers to 
receive a continuous stream of data, based on a subscrip- 
tion request to a given service. RMDP tolerates failures 
of individual servers, by providing facilities to 
automatically reconnect to alternative servers providing 
10 the same service. All the mechanisms for detecting server 
failure and recovery, and for hunting for available serv- 
ers are implemented in the RMDP library. Consequently, 
application programs can be written in a simple and naive 
way. 

15 The protocol provides mechanisms for administering 

load balancing and entitlement policies. For example, 
consider a trading room with three Telerate lines. To 
maximize utilization of the available bandwidth of those 
Telerate lines, the system administrator can "assign" 

20 certain commonly used pages to particular servers, i.e., 
page 5 to server A, page 405 to server B, etc. Each user 
(or user group) would be assigned a "default 19 server for 
pages which are not explicitly preassigned. (These 
assignments are recorded in the TIB™ Services Directory.) 

25 To accommodate failures, pages or users are actually 

assigned to prioritized list of servers. When a server 
experiences a hardware or software failure, RMDP hunts for 
and connects to the next server on the list. When a 
server recovers, it announces its presence to all RMDP 

30 clients, and RMDP reconnects the server's original clients 
to it. (Automatic reconnect ion avoids situations where 
some servers are overloaded while others are idle.) 
Except for status messages, failure and recovery 
reconnections are transparent to the application. 

35 The MDSS protocol suite, including RMDP, is built on 

top of the DCC and utilizes the reliable communication 
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protocols implemented in that component. In particular, 
the MDSS suite utilizes the reliable broadcast protocols 
and the intelligent multicast protocol provided therein. 
RMDP supports both LANs and vide area networks (WANs) • 
5 RMDP also supports dual (or multiple) networks in a 
transparent fashion. 

RMDP is a 11 service-addressed" protocol; a 
complementary protocol, TIBINFO, supports "subject-based 
addressing. n 

10 3.2 Programmatic Interface 

RMDP programs are event-driven. All RMDP function 
calls are non-blocking: even if the call results in 
communication with a server, the call returns immediately. 
Server responses, as well as error messages, are returned 
15 at a later time through an application-supplied callback 
procedure. 

The principal object abstraction implemented in RMDP 
is that of an Rstream, a "reliable stream, " of data that 
is associated with a particular subscription to a 

20 specified service. Although, due to failures and re- 
coveries, different servers may provide the subscription 
data at different times, the Rstream implements the 
abstraction of a single unified data stream. Except for 
short periods during failure or recovery reconnect ion , an 

25 Rstream is connected to exactly one server for the 
specified service. An application may open as many 
Rstreams as needed, subject only to available memory. 

An Rstream is bidirectional*-- in particular, the RMDP 
client can send control commands and messages to the con- 

30 nected server over the Rstream. These commands and mes- 
sages may spur responses or error messages from the 
server, and in one case, a command causes a "derived" 
subscription to be generated. Regardless of cause, all 
data and error messages (whether remotely or locally 

35 generated) are delivered to the client via the appropriate 
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Rstream. 

The RMDP interface is a narrow interface consisting 
of just six functions, which are described below. 



void 

rmdp_setPr op (property, value) 
rmdp_prop_t property; 
caddr_t value; 

Used to set the values of RMDP properties. These 
calls must be made before the call to rmdp_Init(). 
Required properties are marked with ? in the list below. 
Other properties are optional. The properties currently 
used are: 

*RMDP_CALLBACK 

Pointer to the callback function. See the 
15 description of callback below. 

RMDP_SERVICE_HAP 

The name of Services Directory to be used in lieu of 
the standard directory. 

RMDPJ3ROUP 

20 The user group used to determine the appropriate 

server list* Should be prefixed with '+'. Default is 
group is »+" (i.e. the null group) . 



RMDP_RETRY_TIME 

The number of seconds that the client will wait 
25 between successive retries to the same server, e.g., in 
the case of cache full." Default is 30. 

RMDP_QUIET_TIM£ 

The time in seconds that a stream may be "quiet" 
before the protocol assumes that the server has died and 
30 initiates a "hunt" for a different server. Default is 75. 
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RMDP_VERIFY_TIME 

The time in seconds between successive pings of the 
server by the client. Default is 60. 

RMDP_APP_NAME 

The name of the application i.e. "telerate" , 
"reuters" etc. If this property is set, then the relevant 
entries from the Service Directory will be cached. 

void 

rmdp_lnit(); 

This initializes the internal data structures and 
must be called prior to any calls to rmdpJ3et(). 



RStream 

rmdpJ3et< service, request, host) 
char *service, *request, *host; 

15 This is used to get a stream of data for a particular 

'service' and subscription 'request'. For the standard 
market data services, the request will be the name of a 
page (e.g., n 5 w , n AANN w ) • If 'host' is non-NULL, then the 
RMDP will only use the server on the given host. In this 

20 case, no reconnection to alternative servers will be at- 
tempted upon a server failure. If 'host' is NULL, then 
RMDP will consult the Tlfl 1 " Services Directory to identify 
a list of server alternatives for the request, 'rstream' 
is an opaque value that is used to refer to the stream. 

25 All data passed to the application's callback function 
will be identified by this value. 

An error is indicated by RStream rstream — NULL. 
RStream 

rmdp_Derive (rstream, op) 

30 RStreamold; 
char *op; 

This generates a new subscription and, hence, a new 
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' Rstream' from an existing subscription, 'command' is a 
string sent to the server, where it is interpreted to 
determine the specific derivation. 

The standard market data servers understand the 
5 following commands: "n" for next-page , w p" for 
previous-page and "t XXXX" for time-page. 

Derived streams cannot be recovered in the case of 
server failure. If successful, an Rstream is returned, 
otherwise NULL is returned. 

void 

rmdp_Message (rstream, msg) 
RStreamrstream ; 
char *msg ; 

Sends the string 'msg' to the server used by 
'rstream'. The messages are passed directly to the 
server, and are not in any way affected by the state of 
the stream. The messages are understood by the standard 
market data servers include w rr <PAGE NAME>" to rerequest 
a page, and w g a" to request the server's network address. 
Some messages induce a response from the server (such as 
queries) . In this case, the response will be delivered to 
all streams that are connected to the server. 

void 

rmdp_Halt (rstream) 

2 5 RStreamrstream ; 

This gracefully halts the 'rstream'. 

void 

callback (rstream, msgtype, msg, act, err) 
RStreamrstream; 
30 mdp_msg_t msgtype; 
char *msg; 
mdpjact_t act; 
mdp_err_ t err ; 

This is the callback function which was registered 
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with rmdp_SetProp(RMDP_CALLBACK, callback). 'rstream' is 
the stream to which the message pertains, 'msgtype' can 
be any of the values defined below (see "RMDP Message 
Type") . 'msg' is a string which may contain vtlOO 
5 compatible escape sequences, as in MDSS. (It will NOT 
however be prefaced with an A [[...E. That role is assumed 
by the parameter 'msgtype' . ) 

The last two parameters are only meaningful if 
'msgtype' is MDP_MSG_STATUS . 'act' can be any of the 
10 values found in "RMDP Action Type" (see below), but 
special action is necessary only if 

act = ' MDP_ACT_CANCEL ' . The latter indicates that the 
stream is being canceled and is no longer valid. It is up 
to the application to take appropriate action. In either 
15 case, 'err' can be any of the values found in "RMDP Error 
Type" (see below) , and provides a description of the 
status. 



RMDP Message Types (mdp_msg_t) 

The message types are listed below. These types are 
20 defined in the underlying (unreliable) Market Data 
Protocol (MDP) and are exported to the RMDP. 



MDP_MSG_BAD » -1 
MDP_MSG_DATA = 0 
MDP_MSG_STATUS » 1 
25 MDP_MSG_OOB - 2 

MDP MSG QUERY = 3 



Page data message. 

Status/error message. 

"Out of Band" message, e.g., 

time stamp. 
Query result. 



RMDP Action Type (mdp_act_t) 

The action types are listed below. These action 
30 types inform the RMDP clients of activities occurring in 
the lower level protocols. Generally speaking, they are 
"for your information only" messages, and do not require 
additional actions by the RMDP client. The exception is 
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the " MDP_ACT_CANCEL " action, for which there is no 
recovery. These types are defined in the underlying 
(unreliable) Market Data Protocol (HDP) and are exported 
to the RMDP. 

5 MDP_ACT_OK =0 No unusual action required. 

MDP_ACT_CANCEL = 1 The request cannot be serviced, 

cancel the stream, do not 
attempt to reconnect. (E.g., 
invalid page name.) 
10 MDP_ACT_CONN_FIRST « 2 The server is closing the 

stream; the first server in the 
alternatives list is being 
tried. (E.g., the server is 
shedding "extra" clients for 
15 load balancing.) 

MDP_ACT_CONN_NEXT - 3 The server is closing the 

stream; the next server in the 
alternatives list is being 
tried. (E.g., the server's line 
20 to host fails.) 

MDP_ACT_LATER = 4 Server cannot service request at 

this time; will resubmit request 
later, or try a different 
server. (E.g., Cache full.) 
25 MDP_ACT_RETRY = 5 Request is being retried 

immediately. 

RMDP Error Types (mdp_err_t) 

Description of error, for logging or reporting to end 
user. These types are defined in the underlying (unreli- 
30 able) Market Data Protocol (MDP) and are exported to the 
RMDP. 



MDP_ERR_0K = 0 
MDP_ERR_LOW - 1 
MDP ERR QUIET = 2 
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MDP_ERR_INVAL = 3 
MDP_ERR_RESRC = 4 
MDP^ERR^ INTERNAL = 5 
MDPJBRR_DELAY = 6 
MDP_ERR_SYS = 7 
MDP_ERR_COMM « 8 

4. Subject-Addressed subscription Services TIBINFO 



4.1 Introduction 

TIBINFO defines a programmatic interface to the 

10 protocols and services comprising the TIB™ subcomponent 
providing Subject-Addressed Subscription Services (SASS) - 
The TIBINFO interface consists of libraries: 
TIBINFO_CONSUME for data consumers, and TIBINFO_PUBLXSH 
for data providers. An application includes one library 

15 or the other or both depending on whether it is a consumer 
or provider or both. An application can simultaneously be 
a consumer and a producer. 

Through its support of Subject-Based Addressing, 
TIBINFO supports a information-oriented model of 

20 cooperative processing by providing a method for consumers 
to request information in a way that is independent of the 
service (or services) producing the information. 
Consequently, services can be modified or replaced by 
alternate services providing equivalent information 

25 without impacting the information consumers. This 
decoupling of information consumers from information 
providers permits a higher degree of modularization and 
flexibility than that permitted by traditional 
service-oriented processing models. 

30 For Subject-Based Addressing to be useful in a real 

time environment, it must be efficiently implemented. 
With this objective in mind, support for Subject-Based 
Addressing has been built into the low levels of the 
Distributed Communications Component. In particular, the 
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filtering of messages by subject is performed within the 
TIB™ daemon itself. 

4 • 2 Concepts 

Subject 

5 The subject space is hierarchical* Currently, a 

4 -level hierarchy is supported of the following format: 

major [ . minor [ .qualif ierl[ .qualif ier2] ] ] 

where * [ ' and * ] ' are metacharacters that delimit an 
optional component, major, minor, qualif ierl and 

10 qualifier 2 are called subject identifiers. A subject 
identifier is a string consisting of the printable ascii 
characters excluding *?', and A subject identi- 

fier can be an empty string, in which case it will match 
with any subject identifier in that position. The 

15 complete subject, including the separators, cannot 
exceed 32 characters. Subjects are case sensitive. 

Some example of valid subjects are listed below: The 
comments refer to the interpretation of subjects on the 
consume side. (The publish-side semantics are slightly 

20 different.) 

equity . ibm . compos it e . quote 

equity. .composite. quote matches any minor subject 

equity. ibm matches any qualif ierl and 

qualif ier2 

25 equity. ibm. same as above 

Within the TIBINFO and the SASS, subjects are not 
interpreted. Hence, applications are free to establish 
conventions on the subject space. It should be noted that 
SASS components first attempt to match the major and minor 
30 subject identifiers first. As a consequence, although 
applications can establish the convention that 
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"equity.ibm" and w . .equity. ibm" are equivalent subjects, 
subscriptions to "equity. ibm" will be more efficiently 
processed. 

Stream 

5 A stream is an abstraction for grouping 

subscriptions. The subscriptions on a stream share a 
common set of properties, notably the same message handler 
(i.e., "callback" routine) and the same error handler. 
All subscriptions on a stream can be "canceled" simply by 
10 destroying the stream. 

A stream imposes little overhead on the system. They 
can therefore be freely created and destroyed. 

Protocol Engines, Service Disciplines, and Subject Mappers 

The SASS and DCC components implement many support 

15 services in order to provide the functionality in TIBINFO. 
These include subject mappers for efficiently handling 
subjects, service disciplines for controlling the inter- 
action with servers, and protocol engines for implementing 
reliable communication protocols. TIBINFO provides an 

20 interface for setting properties of these components. 
Hence, by setting the appropriate properties, one can 
specify, for example, the behavior of the subject mapper 
through the TIBINFO interface. Since these properties are 
in configuration files, configuration and site dependent 

25 parameters can be altered for the above components by the 
system administrator through TIBINFO. 

In some embodiments, the property definitions for 
TIBINFO and for the underlying components may be augmented 
to support enhancements. This use of properties yields 

30 flexibility and extensibility within the confines of a 
stable functional interface. 

4 . 3 Description 

The TIBINFO interface is high-level and easy to use. 
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Published data can be a form or an uninterpreted byte 
string. Messages can be received either in a synchronous 
fashion, or in an asynchronous fashion that is suitable 
for event-driven programming. The following functions are 
5 sufficient to write sophisticated consumers using 
event -driven programming. 

Tib_streaa *tib_consume__create (property-list, tibjbop) 

Creates a TIBINFO stream that supports multiple 
subscriptions via the "subscribe" function. The 
10 property_list is a (possibly empty) list of property value 
pairs, as illustrated by 

tib_consume_create (TIB_PROP_MSGHANDLER , my_handler , 

TIB_PROP_ERRHANDLER, my_err_handler , TIB_E0P) ; 

Valid properties are defined below. TIB_EOP is a 
15 literal signaling the end of the property list. 

void tibjies troy (stream) 

Tib_stream *stream; 

Reclaims resources used by the specified stream. 

?ib_erxorcode tib_subscribe (stream, subject, client data) 

20 Tib_stream *stream; 
Tib_subject *subject; 
caddr_t clientdata; 

Informs the TIB« software that the client application 
is interested in messages having the indicated subject. If 
25 stream has an associated "message-handler," then it will 
be called whenever a message satisfying the subscription 
arrives. Qualifying messages are delivered on a 
first-in/first-out basis. The value of clientdata is 
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returned in every message satisfying the subscription 
subject. Note that multiple subscriptions to the same 
subject on the same stream are undefined. 

void tib_cancel (stream) 

5 Tib_stream *stream ; 

Cancels the client application's subscription to the 
specified subject. 

void my_ message^ handler ( stream , msg) 

Tib_stream *stream; 

10 Tib_message *message; 

This is the "callback 11 function that was registered 
with the stream. Forms are returned unpacked. The func- 
tion can reference the entire message structure through 
the macros described below. 

15 The following functions are sufficient to write 

producers. Two publishing functions are provided to 
support the different data types that can be transmitted 
through the TIB-INFO interface. 

tib_publish_oreate (property-list, TIB_E0P) 
20 Is used to create an TIBINFO stream for publishing 

records. The property_list is a (possibly empty) list of 
property-value pairs, as illustrated by 

tibjpublish_create (TIB_PROP_ERRHANDLER, myjiandler , TIB_EOP) 
* 

9 

25 Valid properties are defined below. TIB__EOP is a 

constant signaling the end of the property list. 

tib_des troy (stream) 

Tib_stream stream; 

Reclaims resources used by the specified stream. 
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Tiberrorcode tibj>ublish_f orm (stream, subject, form) 

Tib_stream *stream; 
Tib_subject *subject; 
Forn form; 

5 Accepts a single, unpacked form, packs it, and 

publishes it. 

Tiberrorcod. tibjublis1»_buffer< stream, subject, length, 
form) 

Tib_stream *stream; 
10 Tib_subject *subject; 
short length; 
Foro form; 

Accepts a byte buffer of specified length and 
publishes it. 

15 The remaining functions are control functions that 

apply to both the consume and the publish side. 

void Tib_bateh() 

This may be used prior to initiating multiple 
subscriptions, it Informs the TIB™ library that it can 
delay acting on the subscriptions until a tib unbatch is 
seen. This allows the TIB« library to attempt to optimize 
the execution of requests. Mote that no guarantees are 
made about the ordering or timing of -batched- request. 
In particular, (i) requests may be executed prior to the 
receipt of the tib_unbatch function, and (ii) the effects 
of changing properties in the middle of a batched sequence 
of requests is undefined. Batch and unbatch requests may 
be nested. (Note that the use of tibbatch is completely 
optional and it does not change the semantics of a correct 
30 program.) 



20 



25 
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Tib_errorcode tito_stream_set( stream, property, value) 

Tib stream *stream; 
Tibjproperty *property ; 
caddr_t value; 
5 Used to change the dynamically settable properties of 

a stream. These properties are described below. Note 
that some properties can only be set prior to stream 
creation (via tib_default_set) or at stream creation. 

caddr_t tib_stream_get (stream, property) 

10 Tib_stream *stream; 
Tib ^property *property ; 

Used to retrieve the current value of the specified 

property. 

Tib_errorcode tito_default_set( property, value) 

15 Tib_stream *stream; 

Tibjproperty *property ; 

caddr_t value; 

Used to change the initial properties of a stream. 

During stream creation, the default values are used as 
20 initial values in the new stream whenever a property value 

is not explicitly specified in the creation argument list. 



Tib_errorcode tib_def ault_get ( property) 

Tib_stream *stream ; 

Tib jproperty *property ; 

Used to retrieve the default value of the specified 

property. 

tib_unbatch() 

Informs TIBINFO to stop "batching" functions and to 
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execute any outstanding ones. 

TIB INFO Attributes 

The properties defined by TIBINFO and their allowable 
values are listed below and are described in detail in the 
5 appropriate "man" pages. The last grouping of properties 
allow the programmer to send default property values and 
hints to the underlying system components-specif ically, 
the network protocol engines, the TIB™ subject mapper, and 
various service disciplines. 

cfile-handle 
pointer 

error-handler-routine 
tib_message pointer 
message-handler-routine 
protocol-engine-property-list 
protocol-engine-property-cfile 
service-discipl ine -property- 1 ist 
service-discipline-property-cfile 
sub j ect-property-list 
sub j ect-property-cf ile 

TIBINFO Message structure 

The component information of a TIBINFO message can be 
accessed through the following macros: 

tib_msg_clientdata (msg) 
25 tib__msg_sub ject (msg) 
tib_msg_size(m&g) 
tib_msg_value (msg) 

The following macros return TRUE (1) or FALSE (0) : 

tib_msg_is_buf f er (msg) 
30 tib_msg_is_form (msg) 



10 TIB_PROP_CFILE 

TIBJPROP_CLIENTDATA 
TIBJ?ROP_ERRHANDLER 
TIB_PROP_LASTMSG 
TIB_PROP_MSGHANDLER 

15 TIB_PROP_NETWORK 

TIB_PROP NETWORK_CFILE 
TIB_PROP_SERVICE 
TIB_PROP_SERVICE_CFILE 
TIB_PROP_SUBJECT 

20 TIB PROP SUBJECT CFILE 
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5. TIB™ Forms 

5 . 1 Introduction 

The Forms package provides the tools to create and 
manipulate self -describing data objects, e.g., forms. 
5 Forms have sufficient expressiveness, flexibility and ef- 
ficiency to describe all data exchanged between the dif- 
ferent TIB™ applications, and also between the main 
software modules of each application. 

The Forms package provides its clients with one data 

10 abstraction. Hence, the software that uses the Forms pack- 
age deal with only one data abstraction, as opposed to a 
data abstraction for each different type of data that is 
exchanged. Using forms as the only way to exchange user 
data, facilitates (i) the integration of new software 

15 modules that communicate with other software modules, and 
(ii) modular enhancement of existing data formats without 
the need to modify the underlying code. This results in 
software that is easier to understand, extend, and 
maintain. 

20 Forms are the principal shared objects in the TIB™ 

communication infrastructure and applications; 
consequently, one of the most important abstractions in 
the TIB™. 

The primary objective in designing the forms package 

25 were: 

o Extensibility - It is desirable to be able to change 
the definition of a form class without recompiling the 
application, and to be able introduce new 
classes of forms into the system. 

30 o Maintainability - Form-class definition changes may 
affect many workstations; such changes must be propagated 
systematically . 
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o Expressiveness - Forms must be capable of describing 
complex objects; therefore, the form package should 
support many basic types such as integer, real, 
string, etc. and also sequences of these types. 

5 o Efficiency - Forms should be the most common object 
used for sending information between processes-both for 
processes on the same workstation and for processes on 
different workstations. Hence, forms should be designed to 
allow the communication infrastructure to send information 
10 efficiently. 

Note that our use of the term "form" differs from the 
standard use of the term in database systems and so-called 
"forms management systems." in those systems, a "form" 
is a format for displaying a database or file record. 

15 (Typically, in such systems, a user brings up a form and 
paints a database record into the form.) 

Our notion of a form is more fundamental, akin to 
such basic notions as record or array. Our notion takes 
its meaning from the original meaning of the Latin root 

20 word forma . Borrowing from Webster: "The shape and 
structure of something as distinguished from its 
material". Forms can be instantiated, operated on, passed 
as arguments, sent on a network, stored in files and data* 
bases. Their contents can also be displayed in many dif- 

25 ferent formats, "templates" can be used to specify how a 
form is to be displayed. A single form (more precisely, a 
form class) can have many "templates" since it may need to 
be displayed in many different ways. Different kinds of 
users may # for example, desire different formats for 

30 displaying a form. 

5.2 Description 

Forms are self -describing data objects. Each form 
contains a reference to its formclass, which completely 
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describes the form. Forms also contains metadata that 
enables the form package to perform most operations 
without accessing the related formclass definition. 

Each form is a member of a specific form class. All 
5 forms within a class have the same fields and field's 
labels (in fact, all defining attributes are identical 
among the forms of a specific class) . Each form class is 
named and two classes are considered to be distinct if 
they have distinct names (even though the classes may have 

10 identical definitions). Although the forms software does 
not assign any special meaning or processing support to 
particular form names, the applications using it might. 
(In fact, it is expected that certain form naming 
conventions will be established.) 

15 There are two main classification of forms: primitive 

versus constructed forms, and fixed length versus variable 
size forms. 

Primitive forms are used to represent primitive data 
types such as integers, float, strings, etc. Primitive 

20 forms contain metadata, in the form header information 
header and the data of the appropriate type, such as 
integer , string , etc. 

Constructed forms contain sub-forms. A constructed 
form contains other forms, which in turn can contain 

25 subforms. 

Fixed length forms are simply forms of a fixed 
length, e.g., all the forms of a fixed length class occupy 
the same number of bytes. An example for a fixed length 
primitive form is the integer form class; integer forms 

30 always take 6 bytes, (2 bytes for the form header and 4 
bytes for the integer data) . 

Variable size forms contain variable size data: 
variable size, primitive forms contain variable size data, 
such as variable length string; variable size, constructed 

35 forms contain a variable number of subforms of a single 
class. Such forms are similar to an array of elements of 
the same type. 
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5.3 class Identifiers 

When a class is defined it is assigned an identifier. 
This identifier is part of each of the class's form 
instance, and is used to identify the form's class. This 
5 identifier is in addition to its name. Class identifiers 
must be unique within their context of use. Class 
identifiers are 2 bytes long; bit 15 is set if the class 
is fixed length and cleared otherwise; bit 14 is set if 
the class is primitive and cleared otherwise; 

10 5.4 Assignment semantics 

To assign and retrieve values of a form (or a form 
sequence) , "copy" semantics is used. Assigning a value to 
a form (form field or a sequence element) copies the value 
of the form to the assigned location-it does not point to 

15 the given value. 

Clients that are interested in pointer semantics 
should use forms of the basic type Form Pointer and the 
function Form_set_data_pointer. Forms of type Form Pointer 
contain only a pointer to a form; hence, pointer semantics 

20 is used for assignment. Note that the C programming 

language supports pointer semantics for array assignment. 

5.5 Referencing a Form Field 

A sub-form or field of a constructed form can be 
accessed by its field name or by its field identifier (the 

25 latter is generated by the Forms package) . The name of a 
subform that is not a direct descendant of a given form is 
the path name of all the fields that contain the rec asted 
subform, separated by dot. Note that this is similar to 
the naming convention of the C language records. 

30 a field identifier can be retrieved given the field's 

name and using the function Form-class^get^field^id. The 
direct fields of a form can be traversed by using 
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Form_field_id_f irst to get the identifier of the first 
field, and then by subsequently calling Form_f ield_id_next 
to get the identifiers of each of the next fields. 
Accessing a field by its name is convenient; 
5 accessing it by its identifier is fast. Most of the Forms 
package function references a form field by the field's 
identifier and not by the field's name. 

5.6 Form-class Definition Language 

Form classes are specified using the n form-class 
10 definition language," which is illustrated below. Even 
complex forms can be described within the simple language 
features depicted below. However, the big attraction of a 
formal language is that it provides an extensional 
framework: by adding new language constructs the descrip- 
15 tive power of the language can be greatly enhanced without 
rendering previous descriptions incompatible. 

A specification of a form class includes the 
specification of some class attributes, such as the class 
name, and a list of specifications for each of the class's 
20 fields. Three examples are now illustrated: 

{ 

short { 

IS_FIXBD true; 
IS_PRIMITIVE true ; 

25 DATA_SIZE 2; 

DATA_TYPE 9; 
> 

short_array { # A variable size class of shorts. 

IS_FIXED false; 
30 FIELDS { 

{ 

FIELD_CLASS_NAME short; 

} 

}} 
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example_class { 

IS_FIXED true; 
FIELDS { 
first { 

5 FIELD_CLASS_NAME short; 

} 

second { 

FIELD_CLASS_NAME short_array; 

} 

10 third { 

FIELD_CLASS_NAME string_30; 

} 

fourth { 

FIELD_CLASS_NAME integer; 

15 } 
} 

}} 

To specify a class, the class's name, a statement of 
fixed or variable size, and a list of fields must be 
20 given. For primitive classes the data type and size must 
also be specified. All the other attributes may be left 
unspecified, and defaults will be applied. To define a 
class field, either the field class name or id must be 
specify. 

25 The form-class attributes that can be specified are: 



o The class name. 



o CLASS_ID - The unique short integer identifier of 
the class. Defaults to a package specified value. 

o IS_FIXED - Specifies whether its a fixed or 
30 variable 

size class. Expects a boolean value. This is a 
required attribute. 
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o IS_PRIMITIVE - Specifies whether its a primitive 
or 

constructed class. Expects a boolean value* 
Defaults to False. 

5 o FIELDS_NUM - An integer specifying the initial 
number of fields in the form. Defaults to the 
number of specified fields. 

o DATAJTYPE - An integer r specified by the clients, 
that indicates what is the type of the data. Used 
10 mainly in defining primitive classes. It does not 

have default value. 

o DAT ASSIZE - The size of the forms data portion. 
Used 

mainly in defining primitive classes. It does not 
15 have default value. 

o FIELDS - Indicates the beginning of the class's 
fields definitions. 

The field attributes that can be specified are: 

o The class field name. 

20 o FIELD_CLASS_ID - The class id for the forms to 

reside in the field. Note that the class name can 



be 



used for the same purpose. 



o FIELD__CLASS_NAME - The class name for the forms 
25 to 

reside in the field. 



Here is an example of the definition of three 
classes: 
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Note that variable length forms contains fields 
of a single class, "integer" and "string_30 n , used in 
the above examples, are two primitive classes that are 
defined within the Formclass package itself. 

5 5.7 Form Classes are Forms 

Form classes are implemented as forms. This means 
functions that accept forms as an argument also accept 
form classes. Some of the more useful functions on form 
classes are: 

10 o Form_pack, Formjinpack - Can be used to pack 

and 

unpack form classes, 
o Form_copy - Can be used to copy form classes, 
o Form_show - Can be used to print form 
15 classes. 

5.8 Types 

typedef Formclass 

A form-class handle. 

typedef Fonnelass_id 

20 a form-class identifier. 

typedef Foraelass_attr 

A form-class attribute type. Supported 
attributes are: 

o FORMCLASS_SIZE - The size of the class form 
25 instances. 

o FOKMCLASS_NAME - The class name. 



o 



FORMCLASS_ID - A two byte long unique identifier. 
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o FORMCLASS_FIELDS_NUM - The number of (direct) 

fields in the class. This is applicable only for 
fixed length classes. The number of fields in a 
variable length class is different for each 
5 instance; hence, is kept in each form instance. 

o FORMCLASS_INSTANCES_NUM - The number of form 
instances of the given class. 



o FORMCLASS_IS_FIXED - True if its a fixed length 
form, False if its a variable length form. 

10 o FORMCLASS_IS_PRIMITIVE - True if its a form of 

primitive type, False if its a constructed form, 
i.e. the form has sub forms. 

o FORMCLASS_DATA_TYPE - This field value is 

assigned 

15 by the user of the forms a to identify the data 

type of primitive forms. In our current 
application 

we use the types constants as defined by the 
enumerated type Form_data_type , in the file 

20 forms, h. 

o FORMCLASS_DATA_SIZE - This field contains the 
data 

size, in bytes, of primitive forms. For 
instance, 

25 the data size of the primitive class Short is 

two. 

Because it contains the C type short, which is 

kept 

in two bytes. 
30 typedef Formclass_f ield__attr 
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A form-class field attribute type. Supported form 
class field attributes are: 



o FORMCLASS_FIELD_NAME - The name of the class 
field. 

5 o FORMCLASS_FIELD_CLASS_ID - The class id of 

the 

field's form. 



typedef Form 

A form handle. 

typedef Form_field__id 

An identifier for a form's field, it can 
identifies 

fields in any level of a form. A field 
identifier 

can be retrieved form a field name, using the 
function Form-class_get_field_name. A 
fonn_fiald_id ~ 

is manipulated by the functions: 
Form_field_id_first and Form_field_id_next. 

typedef Form_attr 

A form attribute type. Supported form attributes 

are: 

o FORM_CLASS_ID - The form's class identifier. 



o FORM_DATA_SIZE - The size of the form's data. 
Available only for constructed, not 

primitive, 

forms or for primitive forms that are of 
variable size. For fixed length primitive 

forms 

this attribute is available via the form 
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class . 

o FORMJFTELDS_NUM - The number of fields in the 
given form. Available only for constructed, 

not 

5 primitive forms. For primitive forms this 

attribute is available via the form class. 

typedef Form_data 

The type of the data that is kept in primitive 

forms. 

10 typedef Form_pack_format 

Describes the possible form packing types. 
Supported packing formats are: 

o FORM__PACK_LIGHT - Light packing, used mainly 

for 

15 inter process communication between processes 

on 

the same machine. It is more efficient then 
other types of packing. Light packing 

consists 

20 of serializing the given form, but it does 

not 

translates the form data into machine 
independent format. 

o FOKM_PAGK_XDR - Serialize the form while 
25 translating the data into a 

machine- independent 

format* The machine- independent format used 

is 

Sun's XDR. 



30 5.9 Procedural Interface to the Forms-class Package 
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The forme lass package is responsible for creating 
and manipulating forms classes. The forms package uses 
these descriptions to create and manipulate instances 
of given form classes. An instance of a form class is 
called, not surprisingly, a form. 

Formclass_create 

Create a class handle according to the given 
argument list. If the attribute CIASSJZFILE is 
specified it should be followed by a cfile handle 
and a path_name. In that case formclass_create 
locates the specification for the form class in 

the 

specified configuration file. The specification 

is 

15 compiled into an internal data structure for use 



10 



20 



by 

data 
class 



the forms package. 

Formclass_create returns a pointer to the class 
structure. If there are syntax errors in the 



description file the function sets the error 
message 

flag and returns NULL. 

25 Formalass_destroy 

The class description specified by the given 

class 

handle is dismantled and the storage is 
reclaimed. 

30 If there are live instances of the class then the 

class is not destroyed and the error value is 
updated to 
" FORMCLAS S_ERR_NON_Z ER0_I NSTANCES_NUM W . 

Foraclass_get 
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Given a handle to a form class and an attribute 
of a class (e.g. one of the attributes of the 
type Formclass_attr) Formclass_get returns the 
value of the attribute. Given an unknown 
attribute the error value is updated to 
"FORMCLASS_ERR_UNKNOWN_ATTRI-BUTE" . 

Formclass_get_handle_by_id 

Given a forms-class id, 

Formclass_get_handle_by_id returns the handle to 
the appropriate class descriptor. If the 
requested class id is not known 
Form-class_get_handle_by_id returns NULL, but 
does not set the error flag. 

Formclas s_ge t_feandle_by jname 

Given a forms-class name, 

Formclass_get_handle_by_ name returns the handle 
to the appropriate class descriptor. If the 
requested class name is not known 
Foraclass_getJiandle_by_name returns NULL, but 
does not set the error flag. 

Formclass_get_f ield_id 

Given a handle to a form class and a field name 
this function returns the form id, which is used 
for a fast access to the form. If the given 
field name does not exist, it updated the error 
variable to FORMCIASS_ERR_UNKNOWN_FIEU)_NAME . 

Formclass_f ield_get 

Returns the value of the requested field's at- 
tribute. If an illegal id is given this 
procedure, it updated the error variable to 
FORMCLASS_ERR_ UNKNOWN_FIELD_ID . 

Formelass iserr 
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Returns TRUE if Formclass error flag is turned 
on, FALSE otherwise. 

Formclass_errno 

Returns the formclass error number. If no error, 
it returns FORMCLASS_OK. For a list of supported 
error values see the file formclass, h. 



5.10 The Forms Package 
Form_oreate 

Generate a form (i.e., an instance) of the form 
10 class specified by the parameter and return a 

handle to the created form. 

Fo rm__d e s tr oy 

The specified form is "destroyed" by reclaiming 
its storage. 

15 Form_get 

Given a handle to a form and a valid attribute 
(e.g. one of the values of the enumerated type 
Form_attr) Form_get returns the value of the 
requested attribute. 

20 The attribute FORMJDATA_SIZE is supported only 

for variable size forms. For fixed size form 
this information is kept in the class description 
and is not kept with each form instance. 
Requiring the FORM_DATA_SIZB from a fixed length 

25 form will set the error flag to 

FORM_ERR_NO_S I Z B_ATTR_FOR_FIXED_ LENGTHJFORM. 

The attribute FORM_FIELDS_NUM is supported only 
for constructed forms. Requiring the 
FORM_FIELDS_NUM from a primitive form will set 
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the error flag to 

FORM_EKR_ILLEGAL_ATTR_FOR_PRIMITIVE_FORM. 

If the given attribute is not known the error 
flag is set to FORM_ERR_UNKNOWN_ATTR . When the 
5 error flag is set differently, then FORM_OK 

Form_get returns NULL. 



Form_set_data 

Sets the form's data value to the given value. 
The given data argument is assumed to be a 
10 pointer to the data, e.g., a pointer to an 

integer or a pointer to a date structure. 
However for strings we expect a pointer to a 
character . 



Note that we use Copy semantics for assignments. 

15 Form_get_data 

Return a pointer to form's data portion. In case 
of a form of a primitive class the data is the an 
actual value of the form's type. If the form is 
not of a primitive class, i.e., it has a non zero 

20 number of fields, then the form's value is a 

handle to the form's sequence of fields. 

Warning, the returned handle points to the form's data 
structure and should not be altered. If the returned 
value is to be modified is should be copied to a 
25 private memory. 



Foxa_set_data_pointer 

Given a variable size form, Form_set_data_j>ointer 
assigns the given pointer to the points to the 
forms data portion. Form_setjiata_pointer 
30 provide a copy operation with pointer semantics, 

as opposed to copy semantics. 
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If the given form is a fixed length form then the 
error flag is set to 

FORM_ERR_CANT_ASSIGN_POINTER_ TO_FIXED_FORM. 

Form_field_3et_data 

This is a convenient routine that is equal to 
calling Form_f ield_get and then using the 
retrieved form to call Form_set_data. More 
precisely: foru_ f ield_set_data(form, field_id, 
form_data, size) =* 

form_set_data(form_f ield_get(f orm, field_id) , 
form_ data, size), plus some error checking. 

Fona__field_get_data 

Note that we use Copy semantics for assignments. 

This is a convenient routine that is equal to 
calling Form_f ield_get and then using the 
retrieved form to call Form_get_data. More 
precisely: form_ f ield_get_data(form, field_id, 
form_data, size) — ■ 

form_get_data(fora_field_get(form, field_id) , 

formjiata, size) plus some error checking. 

Warning, the returned handle points to the form's data 
structure and should not be altered, if the returned 
value is to be modified is should be copied to a 
private memory. 



15 



20 



25 fora_field_id_first 

Form_field_id_first sets the given field_id to 
identify the first direct field of the given form 
handle. 

Note that the memory for the given f ield_id 
30 should be allocated (and freed) by the clients of 
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the forms package and not by the forms package. 

f orm_f ield_id_next 

Form_f ield_id_first sets the given f ield_id to 
identify the next direct field of the given form 
5 handle. Calls to Form_f ield_id_next must be 

preceded with a call to Form_f ield_id_f irst. 

Note that the memory for the given field_id 
should be allocated (and freed) by the clients of 
the forms package and not by the forms package. 

10 Form_f ield_set 

Sets the given form or form sequence as the given 
form field value. Note that we use Copy 
semantics for assignments. 

When a nonexistent field id is given then the 
15 error flag is set to FORM_ERR_ILLEGAL_ID. 

Form_f ield_get 

Return's a handle to the value of the requested 
field. The returned value is either a handle to a 
form or to a form sequence. 

20 Warning, the returned handle points to the form's 

data structure and should not be altered. If the 
returned value is to be modified is should be 
copied to a private memory, using the Form_copy 
function. 



25 When a nonexistent field id is given, then the 

error flag is set to FORM_ERR_ILLEGAL_ID and 
Form_f ield_get returns NULL* 

Form_f ield_append 

Form_field_append appends the given, append_form 
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argument to the end of the base_form form 
sequence. Form_f ield_append returns the id of 
the appended new field. 

Form_field_delete 

5 Form_f ield_delete deletes the given field from 

the given base_form. 

If a non existing field id is given then the 
error flag is set to FORM_ERR_ILLEGAL_ID and 
Form_field_ delete returns NULL. 

10 Formjpack 

Form_pack returns a pointer to a byte stream that 
contains the packed form, packed according to the 
requested format and type* 

If the required packed type is FORM_PACK_LIGHT 
then Formjpack serializes the form, but the forms 
data is not translated to a machine-independent 
representation. Hence a lightly packaged form is 
suitable to transmit between processes on the 
same machine. 

If the required packed type is FORM_PACK_XDR then 
Formj>ack serializes the form and also translates the 
form representation to a mac w ine-independent 
representation, which is Sun's XDR. Hence form packed 
by an XDR format are suitable for transmitting on a 
network across machine boundaries. 

Formclass.h. are implemented as forms, hence Form_pack 
can be used to pack form classes as well as forms. 

Forajinpack 

Given an external representation of the form, 
create a form instance according to the given 



20 



25 
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class and unpack the external representation into 
the instance. 

Form classes are implemented as forms, hence 
Formjinpack can be used to unpack form classes 
5 as well as forms. 

Fona_copy 

Copy the values of the source form into the 
destination form. If the forms are of different 
classes no copying is performed and the error 
10 value is updated to F0RM_ERR_I LLEG AL_CLAS S . 

Formclasses are implemented as forms, hence 
Form__copy can be used to copy form classes as 
well as forms. 

Formjshow 

15 Return an ASCII string containing the list of 

field names and associated values for indicated 
fields. The string is suitable for displaying on 
a terminal or printing (e.g., it will contain 
new-line characters) . The returned string is 

20 allocated by the function and need to be freed by 

the user. (This is function is very useful in 
debugging.) 

Formclasses are implemented as forms, hence 
Form_show can be used to print form classes as 
25 well as forms. 

ForaJLsarr 

Returns TRUE if the error flag is set, FALSE 
otherwise. 

Form_errno 

30 Returns the formclass error number. If no error, 
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it returns FORMCLASS_OK. The possible error 
values are defined in the file forms. h. 

There follows a list of definitions of some of 
the words and phrases used to describe the invention. 

Access P rocedure : a broader term than service 
discipline or service protocol because it encompasses 
more than a communications protocol to access data from 
a particular server, service, application. It includes 
any procedure by which the information requested on a 
particular subject may be accessed. For example, if 
the subject request is "Please give me the time of 
date", the access procedure to which this request is 
mapped on the service layer could be a call to the 
operating system on the computer of the user that 
Vitiated the request. An access procedure could also 
involve a call to a utility program. 

Application; A software program that runs on a computer 
other than the operating system programs. 

Architectural Decouples a property of a system using 
the teachings of the invention. This property is 
inherently provided by the function of the information 
layer in performing subject-based addressing services 
in mapping subjects to services and service disciplines 
through which information on these subjects may be 
obtained. Subject-based addressing eliminates the need 
for the data consuming processes to know the network 
architecture and where on the network data on a 
particular subject may be found. 



10 



15 



Attribute of a F orm Class ! A property of form class 
such as whether the class is primitive or constructed. 
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Size is another attribute. 

Class ; A definition of a group of forms wherein all 
forms in the class have the same format and the same 
semantics . 

5 Class /Class Descriptor /Class Definition ; A definition 

of the structure and organization of a particular group 
of data records or "forms 11 all of which have the same 
internal representation, the same organization and the 
same semantic information. A class descriptor is a 

10 data record or "object" in memory that stores the data 
which defines all these parameters of the class 
definition. The Class is the name of the group of 
forms and the Class Definition is the information about 
the group's common characteristics. Classes can be 

15 either primitive or constructed. A primitive class 
contains a class name that uniquely identifies the 
class (this name has associated with it a class number 
or class_id) and a specification of the representation 
of a single data value. The specification of the 

20 representation uses well known primitives that the host 
computer and client applications understand such as 
string_20 ASCII, floating point, integer, string_20 
EBCDIC etc. A constructed class definition includes a 
unique name and defines by name and content multiple 

25 fields that are found in this kind of form. The class 
definition specifies the organization and semantics or 
the form by specifying field names. The field names 
give meaning to the fields. Each field is specified by 
giving a field name and the form class of its data 

30 since each field is itself a form. A field can be a 
list of forms of the same class instead of a single 
form. A constructed class definition contains no 
actual data although a class descriptor does in the 
form of data that defines the organization and 

35 semantics of this kind of form. All actual data that 
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def ine instances of forms is stored in forms of 
primitive classes and the type of data stored in 
primitive classes is specified in the class definition 
of the primitive class. For example, the primitive 
class named "Age" has one field of type integer_3 which 
is defined in the class definition for the age class of 
forms. Instances of forms of this class contain 3 
digit integer values. 

Claaa Data structure; All the data stored in a class 
manager regarding a particular class. The class 
descriptor is the mosc important part of this data 
structure, but there may be more information also. 

Class Definition; The specification of a form class. 

Class Pescrlptor: a memory object which stores the 
form-class definition. In the class manager, it is 
stored as a form. On disk, it is stored as an ASCII 
string. Basically, it is a particular representation 
or format for a class definition. It can be an ASCII 
file or a form type of representation. When the class 
manager does not have a class descriptor it needs, it 
asks the foreign application that created the class 
definition for the class descriptor. It then receives 
a class descriptor in the format of a form as generated 
by the foreign application. Alternatively , the class 
manager searches a file or files identified to it by 
the application requesting the semantic-dependent 
operation or identified in records maintained by the 
class manager. The class definiti -ns stored in these 
files are in ASCII text format. Tue class manager then 
converts the ASCII text so found to a class descriptor 
in the format of a native form by parsing the ASCII 
text into the various field names and specifications 
for the contents of each field. 
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Client Application ; a data consuming or data 
publishing process, i.e., a computer program which is 
running, other than an operating system program that is 
linked to the communication interface according to the 
5 teachings of the invention. 

Computer Network ; A data pathway between multiple 
computers by hardware connection such as a local or 
wide area network or between multiple processes running 
on the same computer through facilities provided by the 
10 operating system or other software programs and/ or 

shared memory including a Unix pipe between processes. 

configurat ion Decoupling; The property of a computer 
system/network implementing the teachings of the 
invention which is inherently provided by the 

15 distributed communication layer. This layer, by 

encapsulating the detailed protocols of how to set up 
and destroy communication links on a particular 
configuration for a computer network, frees client 
processes, whether data publishers or data consumers 

20 from the need to know these details. 

Configuration File ; A file that stores data that 
describes the properties and attributes or parameters 
of the various software components, records and forms 
in use. 

25 Constructed Field ; A field which contains another form 
or data record. 

Consumer ; a client or consumer application or end user 
which is requesting data. 



30 



Data Distr ibution Decoupling; The function of the 
communication interface software according to the 
teachings of the invention which frees client . 
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applications of the necessity to know and provide the 
network addresses for servers providing desired 
services. 

Decoupling: Freeing a process, software module or 
application from the need to know the communication 
protocols, data formats and locations of all other 
processes, computers and networks with which data is to 
be interchanged. 

Distributed Communication Laver: the portion of the 
apparatus and method according to the teachings of the 
invention which maps the access procedure identified by 
the service layer to a particular network or 
transparent layer protocol engine and sets up the 
required communication channel to the identified 
service using the selected network protocol engine* 

£iei£: one component in an instance of a form which may 
have one or more components each named differently and 
each meaning a different thing. Fields are "primitive » 
if they contain actual data and are "constructed" if 
they contain other forms, i.e., groupings of other 
fields. A data record or form which has at least one 
field which contains another form is said to be 
"nested". The second form recorded in the constructed 
field of a first form has its own fields which may also 
be primitive or constructed. Thus, infinitely complex 
layers of nesting may occur. 

ESEeigju A computer or software process which uses a 
different format of data record than the format data 
record of another computer or software process. 

f2£m: A data record or data object which is 
self-describing in its structure by virtue of inclusion 
of fields containing class descriptor numbers which 
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correspond to class descriptors, or class definitions. 
These class descriptors describe a class of form the 
instances of which all have the same internal 
representation, the same organization and the same 
5 semantic information. This means that all instances, 

i.e., occurrences, of forms of this class have the same 
number of fields of the same name and the data in 
corresponding fields have the same representation and 
each corresponding field means the same thing. Forms 

10 can be either primitive or constructed. A form is 

primitive if it stores only a single unit of data. A 
form is constructed if it has multiple internal 
components called fields. Each field is itself a form 
which may be either primitive or constructed. Each 

15 field may store data or the class_id, i.e., the class 
number, of another form. 

Format Operation ; An operation to convert a form from 
one format to another format. 

Format or Type ; The data representation and data 
20 organization of a structural data record, i.e., form. 

Handle ; A pointer to an object, record, file, class 
descriptor, form etc. This pointer essentially defines 
an access path to the object. Absolute, relative and 
offset addresses are examples of handles. 

25 Ifi: A unique identifier for a form, record, class, 
memory object etc. The class numbers assigned the 
classes in this patent specification are examples of 
ID'S. 

Information Laver; the portions of the apparatus and 
30 method according to the teachings of the invention 
which performs subject based addressing by mapping 
information requests on particular subjects to the 
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names of services that supply information on the 
requested subject and the service disciplines used to 
communicate with these services. 

Interface ; A library of software programs or modules 
5 which can be invoked by an application or another 

module of the interface which provide support functions 
for carrying out some task* In the case of the 
invention at hand, the communication interface provides 
a library of programs which implement the desired 
10 decoupling between foreign processes and computers to 
allow simplified programming of applications for 
exchanging data with foreign processes and computers. 

Interface Card ; The electronic circuit that makes a 
physical connection to the network at a node and is 
15 driven by transparent layer protocol programs in the 
operating system and network and data-link protocol 
programs on the interface card to send and receive data 
on the network. 

Native Format /Form ; The format of a form or the form 
20 structure native to an application and its host 
computer. 

Nested ; A data structure comprised of data records 
having multiple fields each of which may contain other 
data records themselves containing multiple fields. 

25 Network Protocol Engine : a software and hardware 
combination that provides a facility whereby 
communication may be performed over a network using a 
particular protocol. 

Node : Any computer, server or terminal coupled to the 
30 computer network. 

Primitive Field : A field of a form or data record which 
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stores actual data. 

£E2SSsa: An instance of a software program or module 
in execution on a computer. 

Semantic-Dependent operation - An operation requiring 
5 access to at least the semantic information of the 

class definition for a particular form to supply data 
from that form to some requesting process. 

Semaptjc Information ! with respect to formS/ the names 
and meanings of the various fields in a form. 

10 figsyej:: a computer running a data producer process to 
do something such as supply files stored in bulk 
storage or raw data from an information source such as 
Telerate to a requesting process even if the process is 
running on the same computer which is running the data 

15 producer process. 

Server progenst An application process that supplies 
the functions of data specified by a particular 
service, such as Telerate, Dow Jones Mews Service, etc. 



20 



service.: a meaningful set of functions or data usually 
in the form of a process running on a server which can 
be exported for use by client applications, in other 
words, a service is a general class of applications 
which do a particular thing, e.g., applications 
supplying Dow Jones News information. Quotron datafeed 
25 or a trade ticket router. An application will 

typically export only one service, although it can 
export many different services. 

Seryj,pe Piscjpllne or Service Pr-m^ py. A program or 
software module implementing a communication protocol 
30 for communication with a particular service and 



WO 92/07324 



PCT/US91/07652 



-147- 

including routines by which to select one of several 
servers that supplies a service in addition to 
protocols for communicating with the service and 
advising the communication layer which server was 
5 selected and requesting that a communication link be 
set up. 

Service Access Protocol! A subset of the associated 
service discipline that encapsulates a communication 
protocol for communicating with a service. 

Service Instance : A process running on a particular 
computer and which is capable of providing the 
specified service (also sometimes called a server 
process) . For a given service, several service 
instances may be concurrently providing the service so 
as to improve performance or to provide fault 
tolerance. The distributed communication component of 
the TIB™ communication software implements "fault- 
tolerant" communication by providing automatic 
switchover from a failed service instance to an 
operational one providing the same service. 

Service Lavert the portion of apparatus and method 
according to the teachings of the invention that maps 
data received from the information layer to the access 
procedure to be used to access the service or other 
25 source for the requested information to provide service 
decoupling. 

Service Decoupling; The function of the service layer 
of the communication interface software according to 
the teachings of the invention which frees client 
30 applications of the necessity to know and be able to 
implement the particular communication protocols 
necessary to access data from or otherwise communicate 
with services which supply data on a particular 
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subject. 

Service Record : A record containing fields describing 
the important characteristics of an application 
providing the specified service. 

5 Subject Domain : A set of subject categories (see also 
subject space) . 

Subject Space ; A hierarchical set of subject 
categories. 

Subscribe Request : a request for data regarding a 
10 particular subject which does not specify the source 

server or servers, process or processes or the location 
of same from which the data regarding this subject may 
be obtained. 

Transpo rt Laver : A layer of the standard ISO model for 
15 networks between computers to which the communication 
interface of the invention is linked. 

Transport Protocols The particular communication 
protocol or discipline implemented on a particular 
network or group of networks coupled by gateways or 
20 other internetwork routing. 



* 
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What is claimed is: 

1. An apparatus for facilitating 
communication of data between two or more software 
processes in execution on the same or different 
computers coupled by a data exchange medium where no 
process needs to know the port address of any other 
process, comprising: 

one or more computers; 

a netwc comprised of at least one data 
transfer pat., said network coupling said one 
or more computers by one or more data 
transfer paths; 

at least one application process in 
execution on at least one of said computers 
and capable of requesting data by subject; 

at least one data publishing process 
which may or may not be the same as said 
application process, said data publisning 
process in execution on at least one said 
computer and capable of outputting data on at 
least one subject; 

a subject based addressing program 
including means for obtaining data on 
different subjects, said subject based 
addressing program including computer 
programs linked at least to each of said at 
least one application and data publishing 
processes and to said network for receiving 
and processing subscription requests from 
said at least one application process on at 
least one subject and for mapping said 
subject to an appropriate means for obtaining 
data on the requested subject, and for 
entering a subscription for data on the 
requested subject with said appropriate means 
for obtaining data on said subject, and for 
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receiving data on the requested subject and 
passing the data to the appropriate said one 
or more application processes which requested 
the data. 



5 2. The apparatus of claim 1 further 

comprising a data format decoupling program comprised 
of one or more computer programs coupled to each of 
said one or more application processes and to each of 
said one or more data publishing processes, for 

10 facilitating the transfer of data via said network 
between said data publishing process and said 
application process using self -describing data objects 
or forms by performing format conversion operations 
where the formats for the expression and organization 

15 of data records used by each computer, data publishing 
process or application process may be different, and 
where said self -describing data objects each contain 
one or more fields and are organized into one or more 
classes each of which has a unique class 

20 identification, said data format decoupling program 

including one or more computer programs to define the 
general organization of each class of self-describing 
data objects in terms of the semantic information or 
names of each field and the format information defining 

25 the class identification or code used to express the 

data contained in each field in a class definition, and 
wherein the actual data to be transferred and said 
format information is stored in each instance of a 
self -describing data object, and wherein said data 

30 format decoupling means includes at least one forms 

manager program means for converting the data format of 
data on a subject requested by an application process 
from the data format in which said data is published by 
said data publishing process to a format suitable for 

35 transfer via said network and, upon receipt from said 

network, for converting said data from the format used 
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transfer over said network to a format used by said 
application process, and for performing one or more of 
said format conversion operations using format 
information stored in the instance of the form itself 
or in said class definition. 

3. The apparatus of claim 1 further 
comprising a data format decoupling library comprised 
of one or more computer programs linked at least to 
each said one or more application processes and to said 
one or more data publishing processes, including at 
least one class manager means for performing semantic- 
dependent operations to facilitate the exchange of 
self -describing data objects called forms between said 
at least one application process which uses data in a 
first format and said at least one data publishing 
process which outputs data in a second format which may 
be different from said first format, said forms each 
being comprised of one or more fields each of which 
contains another form which may be either a primitive 
class form in that said field contains data or a 
constructed class form, said constructed class form 
containing one or more fields each which may be a 
primitive class form or another constructed class form, 
such that form classes may be nested to any number of 
nesting levels, said class manager means further for 
facilitating the exchange of data by receiving a 
request to get data from a particular named field of a 
particular instance of a form, and for searching a 
class definition for a field having a name matching the 
field named in said request, said searching including 
searching of all class definitions for any f iele in 
the class definition containing constructed class 
forms, and including searching through all levels of 
nesting of said class definitions, and for returning a 
relative address pointer to the requesting process 
identifying the location of the requested field within 
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instances of the form of the class of forms containing 
the requested field, and further for receiving a 
request for the particular data contained in said field 
named in the original request and using said relative 
5 address pointer and the address of a particular 

instance of said form to read the requested data and 
return said data to said requesting process. 

4. The apparatus of claim 2 further 
comprising a data format decoupling library comprised 

10 of one or more computer programs linked at least to 
each said one or more application processes and said 
one or more data publishing processes, including at 
least one class manager means for performing semantic- 
dependent operations to facilitate the exchange of 

15 self -describing data objects called forms between said 

at least one application process which uses data in a 
first format and said at least one data publishing 
process which may publish data in a second format 
different from said first format, said forms each being 

20 comprised of one or more fields each of which contains 
another form which may be either a primitive class form 
in that said field contains data or a constructed class 
form, said constructed class form containing one or 
more fields each of which may be a primitive class form 

25 or another constructed class form, such that form 

classes may be nested to any number of nesting levels, 
said class manager means also for facilitating the 
exchange of data by receiving a request to get the data 
from a particular field of a particular instance of a 

30 form and for searching a class definition for a field 
having a name matching the field named in said request, 
said searching including searching of all class 
definitions for any fields of said class definition 
containing constructed class forms and including 

35 searching through all levels of nesting of said class 

definitions, and for returning to the requesting 
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process a relative address pointer identifying the 
location of the requested field within instances of the 
form of the class of forms containing the requested 
field, and further for receiving a request for the 
particular data contained in said field named in the 
original request and for using said relative address 
pointer and the address of the particular instance of 
the form to read the requested data and return said 
data to said requesting process. 

5. The apparatus of claim 1 wherein said 
subject based addressing program further comprises 
means for issuing a command to establish a subscription 
communication session on said subject with one or more 
of said data publishing processes capable of supplying 
data on the requested subject , and wherein said means 
for obtaining data on different subjects includes a 
service discipline program for encapsulating a protocol 
for obtaining data on the subject, and for receiving 
said command to establish a subscription communication 
session on said subject, and for establishing a 
subscription communication session with one or more of 
said data publishing processes, and entering a 
subscription request to said one or more data 
publishing processes to supply data on said subject, 
and for receiving data on said subject and passing said 
data to said one or more application processes which 
requested data on said subject. 

6. The apparatus of claim 1 wherein said 
subject based addressing program further comprises 
means for issuing a command to establish a subscription 
communication session with one or more of said data 
publishing processes capable of supplying data on the 
requested subject, and wherein said network further 
comprises transport layer protocol means for 
transferring data through said data transfer path 
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according to a particular protocol native to said 
network , and further comprising a service discipline 
program means for encapsulating a communication 
protocol program capable of being invoked by said 
5 subject based addressing program via said command to 

. establish a subscription communication session on the 
requested subject, and also for establishing a 
subscription communication session with one or more of 
said data publishing processes by invoking said 

10 transport layer protocol means and sending thereto data 

to be transmitted over said network on said subject, 
said service discipline program means also for sending 
an appropriate message to said one or more data 
publishing processes using the appropriate protocol for 

15 communicating with said data publishing processes to 
establish said subscription communication session for 
data on the requested subject, and also for receiving 
data on the requested subject and passing said data to 
said at least one application process which requested 

20 said data. 

7. The apparatus of claim l or 2 or 3 or 4 
wherein said subject based addressing program further 
comprises means for issuing a command to establish a 
subscription communication session with one or more of 
said data publishing processes capable of supplying 
data on the requested subject, and further comprising a 
service discipline means for encapsulating a protocol 
for communicating with one or more of said data 
publishing processes identified by said subject based 
addressing program as capable of supplying data on said 
subject, and for receiving said command to establish a 
subscription communication session with at least one or 
more of said data publishing processes, and for 
establishing a subscription communication session with 
said one or more data publishing processes, and for 
entering a subscription request to said data publishing 
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process, and for receiving data on said subject and 
passing said data to said one or more application 
processes which requested data on said subject. 

8. The apparatus of claim 1 further 
comprising at least two said networks, and means 
coupled to said at least one application process and 
said at least one data publishing process for providing 
network failure fault tolerance by automatically 
switching to an alternate network upon failure of the 
data transfer path being used. 

9. The apparatus of claim 1 or 2 or 3 or 8 
further comprising at least two server computers 
coupled to said network each of which has at least one 
said fata publishing process in execution thereon 
capable of supplying data on the subject upon which 
data has been requested by said one or more application 
processes, and further comprising means coupled at 
least to said server computers and to said at least one 
application process for providing server failure fault 
tolerance by automatically switching to another server 
computer upon which a data publishing process supplying 
data on the requested subject is in execution so as to 
maintain a substantially uninterrupted flow of data on 
the requested subject to said at least one application 
process which requested data on said subject. 

10. The apparatus of claim 1 or 2 or 3 or 7 
further comprising at least two said networks coupling 
at least some of said one or more computers, and 
wherein said one or more computers includes at least 
two server computers upon which at least two data 
publishing processes or service instances are in 
execution publishing data on said subject upon which 
data has been requested and further comprising means 
coupled to said at least one application process and to 
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said at least two service instances and to said at 
least two networks for providing network failure and 
service instance failure fault tolerance by 
automatically switching to an alternate network upon 
5 failure of the data transfer path being used to 

transfer data to said application process on said 
subject and by automatically switching to another 
service instance supplying data on the requested 
subject in case of service instance failure so as to 
10 provide a substantially continuous flow of data on the 
requested subject to said at least one application 
process which is requesting data on said subject. 

11. The apparatus of claim 10 further 
comprising means coupled to said server computers and 

15 to said at least one application process for providing 

server failure fault tolerance by automatically 
switching to another server computer upon which a data 
publishing process or service instance supplying data 
on* the requested subject is in execution so as to 

20 maintain a substantially uninterrupted flow of data on 
the requested subject to said at least application 
process which requested data on said subject. 

12. The apparatus of claim 5 wherein said 
one or more computers includes at least two server 

25 computers each of which has running thereon a data 

publishing process or service instance, at least two of 
said server computers and the service instances in 
execution thereon requiring different communication 
protocols to communicate therewith, and wherein said 

30 service discipline program includes means for 

encapsulating at least two different service discipline 
protocols, each for communicating with at least one of 
said different data publishing processes or service 
instances in execution on said server computers, and 

35 wherein said subject based addressing program includes 
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means for mapping said subject -o the appropriate said 
service discipline protocol, -*id for invoking said 
service discipline protocol so as to establish a 
communication session on the requested subject and, for 
passing data received on said subject to the 
appropriate one or more application processes which 
requested data on said subject, and further comprising 
at least two said networks, and means coupled to said 
at least two networks and to said at least one 
application process and said at least one data 
publishing process or service instance for providing 
network failure fault tolerance by automatically 
switching to an alternate network upon failure of the 
data transfer path being used to transfer data on said 
subject. 

13. The apparatus of claim 5 wherein said 
one or more computers comprises at least two server 
computers, and wherein said at least one data 
publishing process comprises at least two service 
instances in execution on at least two of said server 
computers, and wherein at least two of said service 
instances and /or server computers require different 
communication protocols to communicate therewith , and 
wherein said service discipline program encapsulates at 
least two different service discipline protocols for 
communicating with said at least two different service 
instances, and wherein said subject based addressing 
program includes means for mapping said subject to one 
or more of said service discipline protocols, and 
further includes means for issuing a subscription 
request so as to cause the appropriate service 
discipline protocol to execute and set up a 
communication session so as to obtain data on said 
subject, each said service discipline protocol 
including means for passing data received on said 
subject to the one or more application processes which 
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reguested data on said subject. 

14. The apparatus of claim 12 or 13 wherein 
said service discipline program includes means for 
monitoring the continued viability of one or more of 

5 said server computers as a source of data on the 

requested subject, and, upon failure of the monitored 
server computer supplying data on the requested 
subject, for automatically selecting another server 
computer upon which a service instance is in execution 

10 which is capable of supplying data on said subject, and 
for establishing a communication session therewith on 
said subject by invoking an appropriate service 
discipline protocol, and for passing the resulting data 
on the requested subject to the one or more application 

15 processes which requested said data. 

15. The apparatus of claim 12 wherein said 
service discipline program includes means for 
monitoring the continued viability of at least one said 
service instance as a source of data on the requested 

20 subject with which a communication session has been 

established, and, upon failure of the service instance 
to supply data on the requested subject, for 
automatically selecting another server computer upon 
which another service instance is in execution which is 

25 capable of supplying data on the requested subject, and 
for establishing a communication session therewith by 
invoking an appropriate service discipline protocol and 
establishing a subscription with said service instance 
to supply data on the requested subject, and for 

30 passing the resulting data on the requested subject to 
the one or more application processes which requested 
said data. 

16. The apparatus of claim 1 further 
comprising communication means coupled at least to said 



-159- 



subject based addressing program including at least one 
protocol engine for encapsulating a network 
communication protocol for establishing communications 
regarding said subject using the protocol native to 
said network. 

17. The apparatus of claim 16 wherein said 
communication means includes a plurality of different 
protocol engines, each of which may encapsulate a 
different network communication protocol. 

18. The apparatus of claim 16 wherein at 
least one said protocol engine supports subject based 
addressing by filtering incoming data by subject. 

19. The apparatus of claim 5 wherein said 
service discipline program is coupled to a data 
publishing process supplying data on said subject, and 
supports subject based addressing by filtering data by 
subject so as to conserve network bandwidth. 

20. The apparatus of claim 16 wherein said 
communication means is coupled to each of said 
application and data publishing processes, and wherein 
said application and data publishing processes 
communicate over said network through respective 
protocol engines using the same network communication 
protocol, and wherein said protocol engines cooperate 
to implement a reliable broadcast protocol wherein data 
on a subject is transmitted in discrete messages and 
wherein the complete reception of all discrete messages 
of a broadcast data transmission on a subject for which 
there is an outstanding subscription is verified by the 
protocol engines coupled to the application and data 
publishing processes, and any lost or garbled messages 
are rebroadcast. 
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21. The apparatus of claim 16 wherein said 
communication means is coupled to each of said 
application and data publishing processes, and wherein 
said application and data publishing processes 
communicate over said network through respective 
protocol engines using the same communication protocol, 
and wherein said protocol engines include means for 
cooperating to implement an intelligent multicast 
protocol wherein data regarding a particular subject is 
transmitted over the network using point-to-point 
communication protocol between the data publishing 
process and each application process having a current 
subscription to data on said subject until the number 
of subscribing application processes exceeds a number 
of subscriptions wherein point-to-point communications 
are the most efficient way to send the data, and then 
for switching automatically to a broadcast 
communication protocol, 

22. The apparatus of claim 20 or 21 wherein 
said protocol engines filter data being transmitted 
over said network by subject. 

23. The apparatus of claim 21 wherein said 
protocol engines switch automatically to a reliable 
broadcast communication protocol when a point-to-point 
communication protocol is no longer the most efficient 
way to transmit said data. 

24. The apparatus of claim 21 wherein said 
protocol engines cooperate to implement an intelligent 
multicast protocol wherein any number of switches 
between said point-to-point and broadcast protocols may 
occur depending upon the number of application 
processes subscribing to said subject at any particular 
time, and wherein said point-to-point protocol includes 
reliable point-to-point protocol and said broadcast 
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protocol includes reliable broadcast protocol. 

25. An apparatus for facilitating 
communication of data between two or more software 
processes in execution on the same or different 
computers coupled by a network or data transfer path 
where no process needs to know the network address of 
any other process, comprising: 

one or more computers; 
a network coupling said one or more 
computers by one or more data transfer paths; 

at least one data publishing process in 
execution on at least one said computer so as 
to implement at least one service instance 
and capable of outputting data for 
transmission over said network; 

at least one application process in 
execution on at least one of said computers 
and capable of requesting data from one or 
more of said service instances; 

a service discipline protocol for 
receiving from at least one application 
process a subscription request to obtain 
data, and for encapsulating a communication 
protocol so as to be able to communicate over 
said network so as to obtain said data, and 
for establishing a communication session to 
obtain said data, and, thereafter, for 
receiving said requested data in real time 
including any changes therein, and providing 
said data to said application process. 

26. The apparatus of claim 25 wherein said 
at least one data publishing process comprises a 
plurality of data publishing processes, and wherein 
said service discipline protocol encapsulates at least 
one communication protocol to communic >e with at least 
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each different data publishing process , and wherein at 
least one of said communication protocols includes 
means to receive subscription requests to establish a 
communication session with a particular service 
5 instance, and further comprising means for verifying 
that the application process that has issued a 
subscription request for data is entitled to receive 
said data before invoking the appropriate service 
discipline protocol to obtain said data. 

10 27. The apparatus of claim 25 wherein said 

at least one data publishing process comprises a 
plurality of data publishing processes each of which 
may require a different communication protocol to 
obtain data therefrom, and wherein said service 

15 discipline protocol encapsulates a communication 

protocol to communicate with at least each different 
data publishing process, and wherein said service 
discipline protocol includes means for automatic 
recovery from various types of failures. 

20 28. The apparatus of claim 27 wherein said 

service discipline protocol includes means for 
automatic recovery from failure of a service instance 
by automatic switchover to another service instance 
supplying the same type data. 

25 29. The apparatus of claim 27 wherein said 

at least one data publishing process comprises a 
plurality of data publishing processes, and wherein 
said service discipline protocol encapsulates a 
communication protocol to communicate with at least 

30 each different data publishing process, and wherein 
said service discipline protocol includes means for 
automatic recovery from failures of various types 
including at least failure of a computer upon which a 
service instance is in execution by automatic 
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switchover to another computer upon which is executing 
another service instance supplying the same type data. 

30. The apparatus of claim 25 wherein said 
at least one data publishing process comprises a 

5 plurality of service instances, and wherein said 

service discipline protocol encapsulates a plurality of 
communication protocols, including at least one 
communication protocol to communicate with at least 
each different service instance , and wherein said 
10 service discipline protocol includes means for 

automatic recovery from failure of a computer upon 
which a service instance is in execution or upon 
failure of the service instance itself by automatic 
switchover to another computer having in execution 
15 thereon another service instance supplying the same 
type data. 

31. The apparatus of claim 25 or 26 or 27 or 
28 or 29 or 30 further comprising a communication means 
encapsulating at least one protocol engine coupled to 
said service discipline protocol for interfacing said 
communication protocols of said service discipline 
protocol to the communication protocol of said network, 
such that said application process is decoupled from 
the complexities of communication using whatever 
communication protocol is being used on said network. 

32. The apparatus of claim 25 wherein said 
communication means encapsulates a plurality of 
protocol engines, each of which may encapsulate a 
different network communication protocol. 

30 33. The apparatus of claim 32 wherein said 

service discipline protocol includes means to verify 
that an application process which has issued a 
subscription request for data is entitled to receive 
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such data before the appropriate communication protocol 
of said service discipline protocol is invoked and 
before the appropriate protocol engine is invoked to 
set up a communication session to obtain said data. 

34. The apparatus of claim 25 further 
comprising a communication means coupled to said 
application and data publishing processes and 
encapsulating a plurality of protocol engines coupled 
to said service discipline protocol for interfacing 
said communication protocols of said service discipline 
protocol to the communication protocol of said network, 
and to carry out various reliable communication 
protocols over said network such that said application 
process is decoupled from the complexities of 
communication on said network and need not be concerned 
with the need to verify the accuracy or completeness of 
data received over said network. 

35* The apparatus of claim 31 further 
comprising a communication means coupled to each data 
publishing process for encapsulating at least one 
protocol engine for interfacing said data publishing 
process to the network communication protocol, and for 
encapsulating protocol engines coupled to said 
application processes, said protocol engines 
cooperating to implement a reliable broadcast 
communication protocol wherein the protocol engines 
coupled to a plurality of said application processes 
which are all receiving the same data from the same 
data publishing process via a broadcast communication 
protocol communicate with the protocol engine coupled 
to said data publishing process to insure that all 
messages are successfully received by all subscribing 
application processes and coordinate to cause 
retransmission of missing or garbled messages* 
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36. The apparatus of claim 31 wherein said 
communication means encapsulates a plurality of 
protocol engines for interfacing said service 
discipline protocols to said network communication 
5 protocol, at least some of said protocol engines being 
coupled to said application and data publishing 
processes and including means for cooperating to 
implement a reliable broadcast communication protocol 
wherein the protocol engines coupled to a plurality of 

10 said application processes which are all receiving data 

from the same data publishing process via a broadcast 
communication protocol communicate with the protocol 
engine coupled to the data publishing process to insure 
that all packets of a message are successfully received 

15 by all subscribing application processes including 
cooperating to cause retransmission of missing or 
garbled packets, and at least some of said protocol 
engines cooperating to implement an intelligent 
multicast protocol wherein the protocol engines coupled 

20 to said data publishing process sends data to 

subscribing application processes via a point-to-point 
communication protocol until the number of subscribing 
processes reaches a number where it would be more / 
efficient to send the data via a broadcast protocol, 

25 and then automatically switching to said reliable 
broadcast protocol when said number is reached and 
switching back and forth as needed between said point- 
to-point or reliable broadcast protocols depending upon 
the number of subscribing application processes at any 
30 particular time. 

37. The apparatus of claim 30 further 
comprising a communication means coupled to each data 
publishing process for encapsulating at least one 
protocol engine for interfacing the communication 
35 protocol of said data publishing process to the network 
communication protocol, at least some of said protocol 
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engines coupled to said application and data publishing 
processes cooperating to implement a reliable broadcast 
communication protocol wherein the protocol engines 
coupled to a plurality of said application processes 
5 which are all receiving data from the same data 
publishing process via a broadcast communication 
protocol communicate with the protocol engine coupled 
to the data publishing process to insure that all 
packets of a message are successfully received by all 

10 subscribing application processes including 

retransmission of missing or garbled packets, and 
wherein at least some of said protocol engines coupled 
to said application and data publishing processes 
cooperate to implement an intelligent multicast 

15 protocol wherein a protocol engine coupled to said data 
publishing process sends data to subscribing 
application processes via a point-to-point 
communication protocol until the number of subscribing 
processes reaches a number where it would be more 

20 efficient to send the data via a broadcast protocol and 
automatically switching to said reliable broadcast 
protocol when said number is reached and switching back 
and forth freely between point-to-point and reliable 
broadcast communication protocols depending upon the 

25 number of subscribing application processes at any 
particular time. 

38. The apparatus of claim 35 wherein said 
at least one data publishing process comprises a 
plurality of data publishing processes each running on 

30 a server, and wherein said service discipline protocol 
encapsulates a plurality of communication protocols, 
including at least one communication protocol to 
communicate with at least each different data 
publishing process, and wherein said service discipline 

35 protocol includes means for automatic recovery from 
failure of a server upon which a data publishing 
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process is in execution by automatic switchover to 
another server having in execution thereon another data 
publishing process supplying the same type data, 

39. The apparatus of claim 38 wheredn at 
least some of said protocol engines coupled to said 
application and data publishing processes cooperate to 
implement an intelligent multicast protocol wherein the 
protocol engines coupled to said data publishing 
process sends data to subscribing application processes 
via a point-to-point communication protocol until the 
number of subscribing processes reaches a number where 
it would be more efficient to send the data via a 
broadcast protocol, and automatically switching to said 
reliable broadcast protocol when said number is 
reached, and freely switching back and f 4orth depending 
upon the number of subscribing application processes at 
any particular time. 

40. The apparatus of claim 39 wherein said 
network comprises at least two networks, and wherein 
said service discipline protocol includes means for 
automatic recovery from failure of a data publishing 
process or upon failure of a network by automatic 
switchover to another server having in execution 
thereon another data publishing process supplying the 
same type data as the failed data publishing process 
supplied or automatic switchover to another network so 
as to be able to continue to obtain the requested data. 

41. The apparatus of claim 25 or 26 or 28 or 
29 or 32 or 33 or 34 or 35 wherein said service 
discipline protocol includes means to communicate over 
said network the fact that an application process has 
requested data on a particular subject, and further 
comprising a service discipline protocol coupled to a 
data publishing process, including means for 
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maintaining a list of active subscriptions, and for 
filtering outgoing data by subject to conserve network 
bandwidth. 

42. The apparatus of claim 41 wherein said 
5 service discipline means coupled to said data 

publishing process further comprises means for using 
said subscription list to determine the appropriate 
communication protocol to use in communicating said 
data over said network. 



10 43. The apparatus of claim 25 further 

comprising a subject based addressing program including 
means for obtaining data on different subjects, said 
subject based addressing program including computer 
programs linked at least to each of said at least one 

15 application and data publishing processes and to said 
network for receiving and processing subscription 
requests from said at least one application process on 
at least one subject and for mapping said subject to an 
appropriate means for obtaining data on the requested 

20 subject, and for entering a subscription for data on 

the requested subject with said appropriate means for 
obtaining data on said subject, and for receiving data 
on the requested subject and passing the data to the 
appropriate said one or more application processes 

25 which requested the data. 

44. The apparatus of claim 25 further 
comprising a data format decoupling library comprised 
of one or more computer programs linked at least to 
each said one or more application processes and said 
30 one or more data publishing processes, including at 

least one class manager means for performing semantic- 
dependent operations to facilitate the exchange of 
self -describing data objects called forms between said * 
at least one application process which uses data in a 
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first format and said at least one data publishing 
process which may publish data in a second format 
different from said first format, said forms each being 
comprised of one or more fields each of which contains 
5 another form which may be either a primitive class form 
in that said field contains data or a constructed class 
form, said constructed class form containing one or 
more fields each of which may be a primitive class form 
or another constructed class form, such that form 
10 classes may be nested to any number of nesting levels, 

said class manager means also for facilitating the 
exchange of data by receiving a request to get the data 
from a particular field of a particular instance of a 
form and for searching a class definition for a field 
15 having a name matching the field named in said request, 

said searching including searching of all class 
definitions for any fields of said class definition 
containing constructed class forms and including 
searching through all levels of nesting of said class 
20 definitions, and for returning to the requesting 

process a relative address pointer identifying the 
location of the requested field within instances of the 
form of the class of forms containing the requested 
field, and further for receiving a request for the 
25 particular data contained in said field named in the 
original request and for using said relative address 
pointer and the address of the particular instance of 
the form to read the requested data and return said 
data to said requesting process. 

30 45. The apparatus of claim 25 further 

comprising at least two said networks, and means 
coupled to said at least one application process and 
said at least one data publishing process for providing 
network failure fault tolerance by automatically 

35 switching to an alternate network upon failure of the 
data transfer path being used. 
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46. The apparatus of claim 25 or 43 or 44 or 
45 further comprising at least two server computers 
coupled to said network each of which has at least one 
said data publishing process in execution thereon 
capable of supplying data on the subject upon which 
data has been requested by said one or more application 
processes, and further comprising means coupled at 
least to said server computers and to said at least one 
application process for providing server failure fault 
tolerance by automatically switching to another server 
computer upon which a data publishing process supplying 
data on the requested subject is in execution so as to 
maintain a substantially uninterrupted flow of data on 
the requested subject to said at least one application 
process which requested data on said subject. 

47. The apparatus of claim 43 wherein said 
one or more computers includes at least two server 
computers each of which has running thereon a data 
publishing process or service instance, at least two of 
said server computers and the service instances in 
execution thereon requiring different communication 
protocols to communicate therewith, and wherein said 
service discipline program includes means for 
encapsulating at least two different service discipline 
protocols, each for communicating with at least one of 
said different data publishing processes or service 
instances in execution on said server computers, and 
wherein said subject based addressing program includes 
means for mapping said subject to the appropriate said 
service discipline protocol, and for invoking said 
service discipline protocol so as to establish a 
communication session on the requested subject and, for 
passing data received on said subject to the 
appropriate one or more application processes which 
requested data on said subject. 
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48. The apparatus of claim 25 wherein said 
service discipline program includes means for 
monitoring the continued viability of one or more of 
said server computers as a source of data on the 
requested subject, and, upon failure of the monitored 
server computer supplying data on the requested 
subject, for automatically selecting another server 
computer upon which a service instance is in execution 
which is capable of supplying data on said subject, and 
for establishing a communication session therewith on 
said subject by invoking an appropriate service 
discipline protocol, and for passing the resulting data 
on the requested subject to the one or more application 
processes which requested said data. 

4. * The apparatus of claim 43 further 
comprising communication means coupled at least to said 
subject based addressing program including at least one 
protocol engine for encapsulating a network 
communication protocol for establishing communications 
regarding said subject using the protocol native to 
said network. 

50. The apparatus of claim 25 wherein said 
service discipline program is coupled to a data 
publishing process supplying data on said subject, and 
supports subject based addressing by filtering data by 
subject so as to conserve network bandwidth. 

51. The apparatus of claim 49 wherein said 
communication means is coupled to each of said 
application and data publishing processes, and wherein 
said application and data publishing processes 
communicate over said network through respective 
protocol engines using the same network communication 
protocol, and wherein said protocol engines cooperate 
to implement a reliable broadcast protocol wherein data 
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on a subject is transmitted in discrete messages and 
wherein the complete reception of all discrete messages 
of a broadcast data transmission on a subject for which 
there is an outstanding subscription is verified by the 
protocol engines coupled to the application and data 
publishing processes, and any lost or garbled messages 
are rebroadcast. 

52. The apparatus of claim 49 wherein said 
communication means is coupled to each of said 
application and data publishing processes, and wherein 
said application and data publishing processes 
communicate over said network through respective 
protocol engines using the same communication protocol, 
and wherein said protocol engines include means for 
cooperating to implement an intelligent multicast 
protocol wherein data regarding a particular subject is 
transmitted over the network using point-to-point 
communication protocol between the data publishing 
process and each application process having a current 
subscription to data on said subject until the number 
of subscribing application processes exceeds a number 
of subscriptions wherein point-to-point communications 
are the most efficient way to send the data, and then 
for switching automatically to a broadcast 
communication protocol. 

53. The apparatus of claim 49 or 51 or 52 
wherein said protocol engines filter data being 
transmitted over said network by subject. 

54. The apparatus of claim 25 or 26 or 27 or 
28 or 29 or 30 or 32 or 33 or 34 or 37 further 
comprising data format decoupling means comprised of 
one or more computer programs coupled to each of said 
one or more application processes and said one or more 
data publishing processes, for facilitating the 
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transfer of data via said network between said data 
publishing process and said application process using 
self -describing data objects by performing format 
conversion operations where the formats for the 
5 expression and organization of data records used by 

each computer, data publishing process and application 
process may be different, and wherein said self- 
describing data objects each contain one or more fields 
and are organized into one or more classes each of 
10 which has a unique class identification, and wherein 
the general organization of each class of self- 
describing data objects in terms of the names of each 
field and the format information defining either the 
class identification of the self -describing data object 
15 referenced in a field of another self -describing data 
object or the computer code used to express data 
contained in each field of the self-describing data 
object is defined in a class definition, and wherein 
the actual data to be transferred and said format 
20 information is stored in each instance of a self- 

describing data object, and wherein said data format 
decoupling means includes at least one forms manager 
program means for converting the data format of data on 
a subject requested by an application process from the 
25 data format in which said data is published by said 
data publishing process to a format suitable for 
transfer via said network and, upon receipt from said 
network, for converting said data from the format used 
for transfer over said network to a format used by said 
30 application process which requested the data, and for 
performing one or more of said format conversion 
operations using format information stored in the 
instance of the form itself. 
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MAY SEND WHATEVER MESSAGES 
ARE REQUIRED TO ACCESS 
THE SERVICE(S) AND ENTER OR 
REGISTER A SUBSCRIPTION 
TO START A STREAM OF 
DATA FLOWING AND NOTIFY TIB 
SOFTWARE AT SERVICE(S) OF 
TIB CHANNEL NUMBER 
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I 



SERVICE DISCIPLINE(S) OR 
PROTOCOL ENGINE(S) OPTIONALLY 
MONITORS SERVER(S) PERFORMANCE 
AND SELECTS A NEW SERVER(S) 
UPON FAILURE AND INVOKES 
THE APPROPRIATE PROTOCOL 
ENGINE(S) FOR THE NEW SERVER(S) 
TO SET UP A NEW DATA LINK 
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U 



J. 



ACTIVE PROTOCOL ENGINE(S) 
PERFORM VALUE ADDED SERVICE ON 
TOP pF TRANSPORT LAYER SUCH 
AS RELIABLE BROADCAST 
AND INTELLIGENT MULTICAST 
DURING COMMUNICATION 
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END 



FIGURE 19 B 
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SELECTED SERVICE(S) RECEIVES 
NOTIFICATION OF SUBS. AND 
INITIATES A OATA STREAM 
OIRECTED TO THE PORT ADORESS 
OF THE MACHINE ON WHICH THE 
SUBSCRIBING PROCESS IS 
RUNNING BY CALLING THE TIBINFO 
INTERFACE OF THE SERVICE'S 
INFORMATION LAYER AND 
INVOKING THE TIBINFO-PUBLISH 
FUNCTION TIB-PUBLISH-CREATE* 
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X 



SERVICE(S) STARTS SENDING 
OATA ON MANY DIFFERENT 
SUBJECTS INCLUDING SELECTED 
SUBJECT TO INFORMATION LAYER 
(TIB-INFO) AND INFORMATION 
LAYER CALLS SERVICE LAYER AND 
PASSES POINTER TO MESSAGE 



432 

V 



SERVICE DISCIPLINE^) DOES ANY 
NECESSARY FILTERING. ASSIGNS 
TIB CHANNEL NUMBERS AND 
PASSES POINTERS TO ALL 
MESSAGES TO DCC LIBRARY OF 
COMMUNICATION LAYER AND 
SENDS ANY NECESSARY MESSAGE 
TO SUBSCRIBER PROCESSES 



z 
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DCC LIBRARY FUNCTION FIGURES 
OUT A WAY TO SEND THE 
MESSAGE POINTED TO 8Y THE 
POINTER FROM THE SERVICE 
DISCIPLINE. SELECTS A PROTOCOL 
ENGINE ANO PUTS THE MESSAGE 
IN AN INTERPROCESS TRANSFER 
TO PROTOCOL ENGINE 
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J 



PROTOCOL ENGINE DOES 
APPROPRIATE MESSAGE 
PROCESSING, WRITES PORT 
ADDRESS OF MACHINE ON WHICH 
SUBSCRIBING PROCESS IS 
RUNNING AND ASSIGNED TIB 
CHANNEL NUMBER INTO MESSAGE 
OR PACKET HEADERS OF 
MESSAGES BEING TRANSMITTED 
AND PERFORMS OTHER VALUE 
ADDED SERVICES 
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\ 

PROTOCOL ENGINE TAKES 
APPROPRIATE STEPS TO 
TRANSMIT MESSAGES 
RECEIVED FROM THE SERVICE 
OISCIPUNE TO PORT 
AODRESS OF REQUESTING 
PROCESS ACCOROING TO 
RELIABLE TRANSMISSION 
METHOD SELECTED BY 
SERVICE LAYER OF 
REQUESTING PROCESS OR BY 
DCC LIBRARY 
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TRANSPORT, NETWORK, DATA 
LINK AND PHYSICAL LAYERS 
OF NETWORK ARE INVOKED 
BY PROTOCOL ENGINE TO 
SEND MESSAGES TO PORT 
ADORESS OF REQUESTING 
PROCESS 
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NETWORK INTERFACE CARD 
PICKS UP EACH MESSAGE OFF 
THE NETWORK ANO NOTIFIES 
OPERATING SYSTEM TRANS- 
PORT LAYER PROTOCOL 
SOFTWARE THAT A MESSAGE 
HAS ARRIVED 
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CONT'D ON FIGURE 20 B 



FIGURE 20 A 
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FROM FIGURE 20 A 



OPERATING SYSTEM TRANSPORT ' 
LAYER SOFTWARE CALLS 
PROTOCOL ENGINE OF DAEMON ON 
DISTRIBUTED COMMUNICATION 
LAYER AND PASSES MESSAGE 
TO IT 
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DAEMON REASSEMBLES PACKETS IF 
NECESSARY, DOES OTHER VALUE 
ADDED SERVICES IF APPLICABLE. 
FILTERS IF APPLICABLE AND 
CHECKS THE TIB CHANNEL DATA IN ^ 
THE MESSAGE HEAOER TO DETER- J 
MINE IF ANY PROCESS RUNNING ON 
THE MACHINE HAVING THAT PORT 
ADDRESS HAS SUBSCRIBED TO 
DATA ON THAT TIB CHANNEL 



z 



DAEMON PUTS MESSAGE INTO 
APPROPRIATE INTERPROCESS DATA 
TRANSFER MECHANISM AND CALLS MM 
CALLBACK ROUTINE IN DCC LIBRARY 45 °i 
OF THE SERVICE LAYER J 
ENCAPSULATING THE SERVICE 
DISCIPLINE THAT CALLED THE 
PROTOCOL ENGINE TO START THE 
SUBSCRIPTION TO NOTIFY IT OF A 
MESSAGE AND WHERE TO FIND 
SAME. DCC LIBRARY CALLBACK 
ROUTINE WRITES MESSAGE INTO 
LOCAL MEMORY OF SUBSCRIBING 
PROCESS AND CALLS CALLBACK 
ROUTINE OF SERVICE DISCIPLINE 
TO PASS IT A POINTER TO MES SAGE 



SERVICE DISCIPLINE CALLBACK 
ROUTINE RUNS, RETRIEVES 
MESSAGE FROM LOCAL MEMORY 
USING POINTER FROM DCC LIBRARY 
CALLBACK ROUTINE, DOES ANY 
NECESSARY DATA FORMAT 
CONVERSION AND CALLS CALLBACK 
ROUTINE OF THE TIB-INFO 
INTERFACE LIBRARY ON THE 
INFORMATION LAYER 



T 
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U 



TIB-INFO CALLBACK ROUTINE 
EXAMINES MESSAGE HEADER 
USING POINTER SUPPUEO BY 
THE SERVICE DISCIPLINE 
CALLBACK ROUTINE AND 
COMPARES THE TIB CHANNEL 
COOE TO THE SUBSCRIPTION 
TO VERIFY THAT THE MESSAGE 
IS ON THE CORRECT SUBJECT 
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IF THE MESSAGE IS ON THE 
SUBSCRIBED SUBJECT, TIB- 
INFO CALLS THE CALLBACK 
ROUTINE OF THE 
SUBSCRIBING PROCESS AND 
PASSES IT A POINTER TO THE 
MESSAGE IN LOCAL MEMORY 



z 
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V 



SUBSCRIBING PROCESS 
RETRIEVES THE MESSAGE 
AND USES IT FOR 
WHATEVER PURPOSE IT IS 
NEEDED 
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FIGURE 20B 
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500 

DCC LIBRARY RECEIVES REQUEST 
FROM SERVICE DISCIPLINE TO SEND A 
MESSAGE HAVING A TIB CHANNEL 
ASSIGNED BY SERVICE DISCIPLINE 



502 




DCC LIBRARY CHECKS SUBSCRIPTION 
LIST TO DETERMINE HOW MANY 
SUBSCRIBERS THERE ARE TO THIS TIB 
CHANNEL 



S.D. 



IS 

IT MORE 
EFFICIENT TO 
SEND THE MESSAGE" 
POINT-TO-POINT? 
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YES 



S.D. 



CALL THE POINT-TO-POINT 
PROTOCOL ENGINE AND 
SEND THE MESSAGE POINT- 
TO-POINT 



NO 



508 



CALL THE RELIABLE 
BROADCAST ENGINE 



510 



ADD SEQUENCE NUMBERS AND TIB 
CHANNEL NUMBERS TO PACKET 
HEADERS AND SEND ANY NECESSARY 
MESSAGES TO SUBSCRIBER PROCESS 



END 



WRITE ALL PACKETS WITH THEIR 
SEQUENCE NUMBERS AND TIB CHAN- 
NEL NUMBERS TO A RETRANSMIT MEM- 
ORY IN CASE ONE OR MORE PACKETS 

NEED TO BE RETRANSMITT ED 



CONT'D FIGURE 21 B 

FIGURE 21 A 
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FROM FIGURE 21 A 



51 



SEND MESSAGES TO ALL 
SUBSCRIBING PROCESSES FOR 
THIS TIB CHANNEL TO LISTEN FOR 
DATA ON THEIR REQUESTED 
SUBJECT ON TIB CHANNEL XX 
WHERE XX IS THE TIB CHANNEL 
ASSIGNED TO THIS SUBJECT 



TRANSMIT PACKETS VIA 
STANDARD BROADCAST 
PROTOCOLS OF TRANSPORT 
LAYER 



FIGURE 21 B 
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COMMUNICATION LAYER OF 
SUBSCRIBING PROCESS NOTIFIED 
OF ARRIVING PACKETS BY 
TRANSPORT PROTOCOL AND 
RETRIEVES PACKETS FROM 
INTERPROCESS TRANSFER 
MECHANISM 
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IF TIB CHANNEL NUMBER IN 
PACKET HEADERS MATCH AN 
OPEN SUBSCRIPTION BY A CLIENT 
PROCESS. THE RELIABLE 
BROADCAST PROTOCOL ENGINE 
OF DAEMON CHECKS RELIABILITY 
SEQUENCE NUMBERS AGAINST 
DATA REGARDING SEQUENCE 
NUMBERS THAT MAKE UP THE 
ENTIRE MESSAGE AND ERROR 
CHECKING IS PERFORMED IN • 
SOME EMBODIMENTS 



RECEIVING COMMUNICATION 
LAYER SENDS MESSAGE BACK TO 
TRANSMITTING COMMUNICATION 
LAYER SAYING EITHER THAT ALL 
PACKETS HAVE BEEN 
SUCCESSFULLY RECEIVED OR 
ASKING THAT CERTAIN PACKETS 
BE RETRANSMITTED 



X 

COMMUNICATION LAYER OF 
SERVICE OR DATA PUBLISHING 
PROCESS RETRANSMITS MISSING 
OR GARBLED PA CKETS 

_ ™ 

CONT'D FIGURE 22 B 

FIGURE 22 A 
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FROM FIGURE 22 A 



RECEIVING COMMUNICATION 
LAYER VERIFIES THAT 
REPLACEMENT PACKETS HAVE 
BEEN CORRECTLY RECEIVED AND 
ACKNOWLEDGES THIS FACT TO 
THE COMMUNICATION LAYER OF 
THE SERVICE OR DATA 
PUBLISHING PROCESS OR AGAIN 
REQUESTS RETRANSMISSION OF 
MISSING OR GARBLED PACKETS 
AND PROCESS REPEATS UNTIL 
ALL PACKETS SUCCESSFULLY 
RECEIVED AND FINAL 
ACKNOWLEDGMENT MESSAGE IS 
SENT 



528 



TRANSMITTING RELIABLE 
BROADCAST PROTOCOL ENGINE. 
UPON RECEIPT OF FINAL 
ACKNOWLEDGMENT MESSAGE OF 
SAFE RECEIPT OF ALL PACKETS, 
FLUSHES PACKETS FROM 
RETRANSMISSION MEMORY 
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21 



RECEIVING PROTOCOL ENGINE 
CHECKS TIB CHANNEL DATA 
AGAINST SUBSCRIPTION LIST AND 
PASSES POINTER TO SERVICE 
DISCIPLINES LINKED TO ALL 
SUBSCRIBING PROCESSES 
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!1 



MESSAGES PASSED THROUGH 
SERVICE AND INFORMATION 
LAYERS TO SUBSCRIBING 
PROCESSES 



FIGURE 22 B 
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-So/20 



600-^ 




608-v 
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SERVICE DISCIPLINE OF DATA 
PUBLISHING PROCESS RECEIVES 
A SUBSCRIBE REQUEST, PASSES 
IT THROUGH INFORMATION LAYER 
TO SERVICE AND ENTERS 
SUBSCRIPTION IN SUBSCRIPTION 
LIST 


IF THE NUMBER OF SUBSCRIBERS 
IS ABOVE THE CUTOFF NUMBER 
ESTABLISHED BY THE COST 
FUNCTION. CALL THE PROTOCOL 
ENGINE FOR RELIABLE 
BROADCAST AND PUT THE 
MESSAGE IN AN INTERPROCESS 
TRANSFER MECHANISM TO THIS 
PROTOCOL ENGINE 


60^ 


r 




MESSAGE PUBLISHED BY SERVICE 
THROUGH INFORMATION LAYER 
TO SERVICE LAYER 


IF NOT. CALL THE POINT-TO-POINT 
PROTOCOL ENGINE AND PUT THE 
MESSAGE IN AN INTERPROCESS 
TRANSFER MECHANISM TO THIS 
PROTOCOL ENGINE 


604^. 


r 




61 2-. 
r A 


SERVICE DISCIPLINE OF SUBJECT 
LAYER DETERMINES HOW MANY 
SUBSCRIBERS THERE ARE FOR 
THIS SUBJECT, AND ASSIGNS A 
TIB CHANNEL IF NECESSARY 


AWAIT NEXT MESSAGE OR 
SUBSCRIPTION AND RETURN TO 
STEP 600 IF A SUBSCRIPTION OR 
STEP 602 IF A MESSAGE 


606^ ^ 




SERVICE DISCIPLINE COMPARES 
NUMBER OF SUBSCRIBERS FOR 
THIS SUBJECT TO A 
PROGRAMMABLE CUTOFF 
NUMBER BASED UPON A COST 
FUNCTION OF COSTS FOR 
TRANSMISSION POINT-TO-POINT 
VERSUS BY RELIABLE 
BROADCAST 
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